WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Intemauonal Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 




(11) International Publication Number: 


WO 00/18800 


C07K 14/705, C12N 5/10, 15/62, 15/12, 
15/63 


Al 


(43) International Publication Date: 


6 April 2000 (06.04.00) 



(21) International Application Number: PCT/US99/22818 

(22) International Filing Date: 30 September 1999 (30.09.99) 



(30) Priority Data: 

09/163,523 



30 September 1998 (30.09.98) US 



(63) Related by ConUnuation (CON) or Continuation-in-Part 
(CIP) to Earlier Application 

US 09/163,523 (CON) 

Filed on 30 September 1998 (30.09.98) 



(71) Applicant (for all designated States except US): MILLEN- 

NIUM B IOTHER APEUT1CS , INC. [US/USJ; 620 Memo- 
rial Drive, Cambridge, MA 02139 (US). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): BUSFIELD, Samantha, J. 
[AU/US]; Apartment #1,15 Trowbridge Street, Cambridge, 
MA 02138 (US). 

(74) Agent: MEIKLEJOHN, Anita, L.; Fish & Richardson, P.C., 
225 Franklin Street, Boston, MA 02110-2804 (US). 



(81) Designated States: AE, AL, AM, AT, AU. AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE, 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID. IL, IN, IS, JP, 
KE, KG, KP, KR, KZ. LC, LK, LR, LS, LT, LU. LV, MD, 
MG, MK, MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, 
SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, US. 
UZ, VN. YU, ZA, ZW, ARIPO patent (GH, GM, KE, LS, 
MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, AZ, 
BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, 
CH, CY, DE, DK, ES. FI, FR. GB, GR, IE. IT, LU, MC 
NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, 
GN, GW, ML, MR, NE, SN, TD. TG). 



Published 

With international search report 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: NOVEL SECRETED IMMUNOMODULATORY PROTEINS AND USES THEREOF 
(57) Abstract 

The invention concerns cDNA molecules encoding TANGO 191 and TANGO 195, both of which are transmembrane proteins. 
The nucleic acids and polypeptides of the present invention are useful as modulating agents in regulating a vanety of cellular processes. 
Accordingly in one aspect, this invention provides isolated nucleic acid molecules encoding a polypeptide of the invention or biologically 
active portion thereof. The present invention also provides nucleic acid molecules which are suitable as primers or hybridization probes for 
the detection of nucleic acids encoding a polypeptide of the invention. 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spam 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


A2 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


!L 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


ci 


Cote d'lvoire 


KF 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






CU 


Cuba 


KZ 


Kazaksun 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU . 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 00/18800 



PCT/US99/22818 



- 2 - 

Summary of the Invention 
The present invention is based, at least in part, 
on the discovery of cDNA molecules encoding TANGO 191 and 
TANGO 195, both of which are transmembrane proteins. 
5 These proteins, fragments, derivatives, and variants 
thereof are collectively referred to as "polypeptides of 
the invention" or "proteins of the invention." Nucleic 
acid molecules encoding polypeptides of the invention are 
collectively referred to as "nucleic acids of the 

10 invention . " 

The nucleic acids and polypeptides of the present 
invention are useful as modulating agents in regulating a 
variety of cellular processes. Accordingly, in one 
aspect, this invention provides isolated nucleic acid 

15 molecules encoding a polypeptide of the invention or a 
biologically active portion thereof. The present 
invention also provides nucleic acid molecules which are 
suitable as primers or hybridization probes for the 
detection of nucleic acids encoding a polypeptide of the 

2 0 invention. 

The invention features nucleic acid molecules 
which are at least about 45% (or 55%, 65%, 75%, 35%, 95%, 
or 98%) identical to the nucleotide sequence shown in SEC 
ID NO: 1, 3, 4, 6 or , or the nucleotide sequence of 

25 the cDNA insert of either the clone deposited with the 
American Type Culture Collection, Manassas, VA (ATCC) as 

Accession Number or the clone deposited with the 

ATCC as Accession Number (the " cDNA of ATCC " 

or the "cDNA of ATCC "), or a complement thereof. 

30 The invention features nucleic acid molecules 

which include a fragment of at least 30C (325, 350, 375, 
400, 425, 450, 500, 55 0, 600, 650, 70C, 800, 90C, 1000, 
or 1200) nucleotides of the nucleotide sequence of SEQ 
I D NO : 1 , 3, 4, 6 or or the nucleotide sequence of the 
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NOVEL SECRETED IMMUNOMODULATORY PROTEINS 
AND USES THEREOF 



10 Background of the Invention 

Many secreted proteins, for example, cytokines and 
cytokine receptors , play a vital role in the regulation 
of cell growth, cell differentiation, and a variety of 
specific cellular responses. A number of medically 

15 useful proteins, including erythropoietin, granulocyte- 
macrophage colony stimulating factor, human growth 
hormone, and various interleukins , are secreted proteins. 
Thus, an important goal in the design and development of 
new therapies is the identification and characterization 

20 of secreted proteins and the genes which encode them. 

Many secreted proteins are receptors which bind a 
ligand and transduce an intracellular signal, leading to 
a variety of cellular responses. The identification and 
characterization of such a receptor enables one to 

25 identify both the ligands which bind to the receptor and 
the intracellular molecules and signal transduction 
pathways associated with the receptor, permitting one to 
identify or design modulators of receptor activity, e.g., 
receptor agonists or antagonists and modulators of signal 

30 transduction. 
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Also within the invention are isolated 
polypeptides or proteins which are encoded by a nucleic 
acid molecule having a nucleotide sequence that is at 
least about 65%, preferably 75%, 85%, or 95% identical to 

a nucleic acid sequence encoding SEQ ID NO: 2, 5 or ; 

and isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence 
which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 

of SEQ ID NO : 1 , 3, 4, 6 or a complement thereof or 

the non- coding strand of the cDNA of ATCC or the 

cDNA of ATCC . 

Also within the invention are polypeptides which 
are a naturally occurring allelic variants of a 
polypeptide that includes the amino acid sequence of SEQ 

ID NO: 2, 5 or an amino acid sequence encoded by the 

cDNA of ATCC , or an amino acid sequence encoded by 

the cDNA of ATCC , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule having the sequence of SEQ ID NO: 

l t 2, 4, 6 or or a complement thereof under stringent 

conditions . 

Tr.e invention also features nucleic acid molecules 
thac hybridize under stringent conditions to a nucleic 
acid molecule having the nucleotide sequence of SEQ ID 

NO;l, 3, 4, or 6, the cDNA of ATCC or the cDNA of 

ATCC , or a complement thereof. In other 

embodiments, the nucleic acid molecules are at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
80C. 90C, 1000, or 1290) nucleotides in length and 
hybridizes under stringent conditions to a nucleic acid 
molecule having the nucleotide sequence of SEQ ID NO:l, 

or tne cDNA ATCC , or the cDNA of ATCC 

cr a complement thereof. 
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cDNA of ATCC or the cDNA of ATCC , or a 

complement thereof . 

The invention also features nucleic acid molecules 
which include a nucleotide sequence encoding a protein 
5 having an amino acid sequence that is at least about 45% 
(or 55%, 65%, 75%, 85%, 95%, or 98%) identical to the 

amino acid sequence of SEQ ID NO: 2, 5 or or the amino 

acid sequence encoded by the cDNA of ATCC or the 

cDNA of ATCC , or a complement thereof. 

10 In preferred embodiments, the nucleic acid 

molecules have the nucleotide sequence of SEQ ID NO: l, 

3, 4, 6 or or the nucleotide sequence of the cDNA of 

ATCC , or the cDNA of ATCC . 

Also within the invention are nucleic acid 

15 molecules which encode a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO: 2 or 5 the fragment 
including at least 15 (25, 30, 50, 100, 150, 300, or 400) 
contiguous amino acids of SEQ ID NO: 2 or 5, the 
polypeptide encoded by the cDNA of ATCC , or the 

20 polypeptide encoded by the cDNA of ATCC 

The invention includes nucleic acid molecules 
which encode a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2.. 5 or the amino acid sequence encoded by the 

25 cDNA of ATCC , or the amino acid sequence encoded by 

the cDNA of ATCC , wherein the nucleic acid molecule 

hybridizes to a nucleic acid molecule having a nucleic 
acid sequence encoding SEQ ID NO: 2 , 5 or or a 

complement thereof under stringent conditions. 

30 Also within the invention are isolated 

polypeptides or proteins naving an amino acid sequence 
that is a: least about 65% , m preferably 75%, 85%, 95%, or 
98% identical to the amino acid sequence of SEQ ID NO: 2. 
5 or 
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polypeptide; (2) the ability to bind a ligand of the 
naturally-occurring polypeptide/ (3) the ability to bind 
to an intracellular target of the naturally-occurring 
polypeptide. Other activities include: (1) the ability 
5 to modulate cellular proliferation; (2) the ability to 
modulate cellular differentiation; and (3) the ability to 
modulate cell death. 

In one embodiment, a polypeptide of the invention 
has an amino acid sequence sufficiently identical to at 

10 least one domain of a polypeptide of the invention. As 
used herein, the term "sufficiently identical" refers to 
a first amino acid or nucleotide sequence which contains 
a sufficient or minimum number of identical or equivalent 
(e.g., with a similar side chain) amino acid residues or 

15 nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide 
sequences have a common structural domain and/or common 
functional activity. For example, amino acid or 
nucleotide sequences which contain a common structural 

20 domain having about 65% identity, preferably 75% 

identity, more preferably 85%, 95%, or 98% identity are 
defined herein as sufficiently identical. 

In one embodiment, the isolated polypeptide lacks 
both a transmembrane and a cytoplasmic domain. In 

25 another embodiment the polypeptide lacks both a 

transmembrane domain and a cytoplasmic domain and is 
soluble under physiological conditions. 

The polypeptides of the present invention, or 
biologically active portions thereof, can be operably 

30 linked to a heterologous amino acid sequence to form a 
fusion protein. The invention further features 
antibodies that specifically bind a polypeptide of the 
invention such as monoclonal or polyclonal antibodies. 
In addition, the polypeptides of the invention or 

35 biologically active portions thereof can be incorporated 
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In preferred embodiments, the isolated nucleic 
acid molecules encode a cytoplasmic (SEQ ID NO-.ll or 16), 
transmembrane (SEQ ID NO:10 or 15), or extracellular (SEQ 
ID NO: 9 or 14) domain of a polypeptide of the invention 
5 or a complement thereof. In another embodiment, the 
invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a nucleic acid 
of the invention. 

Another aspect of the invention provides vectors, 

10 e.g., recombinant expression vectors, comprising a 
nucleic acid molecule of the invention. In another 
embodiment, the invention provides host cells containing 
such a vector. The invention also provides methods for 
producing a polypeptide of the invention by culturmg, in 

15 a suitable medium, a host cell of the invention 

containing a recombinant expression vector such that a 
the polypeptide is produced. 

Another aspect of this invention features isolated 
or recombinant proteins and polypeptides of the 

20 invention. Preferred proteins and polypeptides possess 
at least one biological activity possessed by the 
corresponding naturally-occurring human polypeptide. An 
activity, a biological activity, and a functional 
activity of a polypeptide of the invention refers to an 

25 activity exerted by a protein, polypeptide or nucleic 
acid molecule of the invention on, for example, a 
responsive cell as determined in vivo, or in vitro, 
according to standard techniques. Such activities can be 
a direct activity, such as an association with or an 

30 enzymatic activity on a second protein or an indirect 

activity, such as a cellular signaling activity mediated 
by interaction of the protein with a second protein. 
Thus, such activities include, e.g., (1! the ability to 
form protein : protein interactions with proteins in the 

35 signaling pathway of the naturally-occurring 
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nucleic acid of the invention. In other embodiments, the 
modulator is a peptide, pept idomimet ic , or other small 
molecule . 

The present invention also provides diagnostic 
5 assays for identifying the presence or absence of a 

genetic lesion or mutation characterized by at least one 
of: (i) aberrant modification or mutation of a gene 
encoding a polypeptide of the invention, (ii) mis- 
regulation of a gene encoding a polypeptide of the 

10 invention, and (iii) aberrant post- translational 

modification of a protein of the invention wherein a 
wild- type form of the gene encodes a protein having the 
activity of the protein of the invention. 

In another aspect, the invention provides a 

15 method fcr identifying a compound that binds to or 

modulates the activity of a polypeptide of the invention. 
In general, such methods entail measuring a biological 
activity of the polypeptide in the presence and absence 
of a test compound and identifying those compounds which 

20 alter the activity of the polypeptide. 

The invention also features methods for 
identifying a compound which modulates the expression of 
a polypeptide or nucleic acid of the invention by 
measuring the expression of the polypeptide or nucleic 

2 5 acid in the presence and absence of the compound. 

Other features and advantages of the invention 
will be apparent from the following detailed description 
and claims. 

Brief Description of the Drawings 
30 Figure 1 depicts the cDNA sequence (SEQ ID NO:li 

and predicted amino acid sequence (SEQ ID NO: 2) of human 
TANGC 1S1. The open reading frame cf SEQ ID NO : 1 extends 
from nucleotide 557 to 2353, inclusive (SEQ ID NO : 3 ) . 
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into pharmaceutical compositions, which optionally 
include pharmaceutical^ acceptable carriers. 

In another aspect, the present invention provides 
methods for detecting the presence of the activity or 
5 expression of a polypeptide of the invention in a 

biological sample by contacting the biological sample 
with an agent capable of detecting an indicator of 
activity such that the presence of activity is detected 
in the biological sample. 

10 In another aspect, the invention provides methods 

for modulating activity of a polypeptide of the invention 
comprising contacting a cell with an agent that modulates 
(inhibits or stimulates) the activity or expression of a 
polypeptide of the invention such that activity or 

15 expression in the cell is modulated. In one embodiment, 
the agent is an antibody that specifically binds to a 
polypeptide of the invention. 

In another embodiment, the agent modulates 
expression of a polypeptide of the invention by 

20 modulating transcription, splicing, or translation of an 
mRNA encoding a polypeptide of the invention. In yet 
another embodiment, the agent is a nucleic acid molecule 
having a nucleotide sequence that is antisense zc the 
coding strand of an mRNA encoding a polypeptide of the 

2 5 invention. 

The present invention also provides methods for 
treating a subject having a disorder characterized by 
aberrant activity of a polypeptide of the invention or 
aberrant expression of a nucleic acid or polypeptide of 

3 0 the invention by administering an agent which is a 

modulator of the activity of a polypeptide of tne 
invention or a modulator of the expression of a nucleic 
acid or polypeptide of the invention to the subject. In 
one embodiment, the modulator is a protein of the 
3 5 invent lcr. . In another embodiment, the modulator is a 
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Detailed Description of the Invention 
The present invention is based on the discovery of 
cDNA molecules encoding TANGO 191 and TANGO 195, both of 
which are secreted proteins. The subsections and Tables 
5 summarize certain features of TANGO 191 and TANGO 195. 



TANGO 191 

The human TANGO 191 cDNA of SEQ ID NO:l has a 1797 
nucleotide open reading frame (SEQ ID NO: 3) encoding a 
599 amino acid protein (SEQ ID NO:2) . The cDNA and 

10 protein sequences of human TANGO 191 are shown in Figure 
1. This cDNA was isolated from a human mixed lymphocyte 
reaction library based on its sequence similarity to 
genes encoding certain members of the interleukin- 1 (IL- 
1) receptor superfamily. 

15 Human TANGO 191 is a transmembrane protein having 

' a 19 amino acid signal sequence (amino acids 1 - 19 of 
SEQ ID NO: 2; SEQ ID NO: 7) followed by a 58 0 amino acid 
mature protein (amino acids 20 - 599 of SEQ ID NO: 2; SEQ 
ID NO: 8) . Mature TANGO 191 is predicted to have a 

2 0 transmembrane domain that extends from amino acid 3 58 to 
amino acid 382 of SEQ ID NO:2 (SEQ ID NO:10). an 
extracellular domain that extends from amino acid 20 to 
amino acid 357 of SEQ ID NO:2 (SEQ ID NO:9), and a 
cytoplamic domain extending from amino acid 3 83 to amino 

25 acid 599 of SEQ ID NO:2 (SEQ ID NO: 11) . 

TANGO 191 has a molecular weight of 68.3 kDa prior 
to cleavage of its signal peptide and a molecular weight 
of 66.1 kDa after cleavage of its signal peptide. 

TANGO 191 has four potential N-glycosylation sites 

30 (amino acids 21-24, 119-122, 152-155, and 345-248 of SEQ 
ID NO:2); 15 potential protein kinase C phosphorylation 
sites (ammo acids 26-28, 35-37, 63-65, 160-162, 203-205, 
233-235, 272-275, 307- 309, 311-313, 327-329, 474-476, 
506-508, 538-540, 575-577, and 590-592 of SEQ ID NO : 2 ) ; 
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Figure 2 is a hydropathy plot of TANGO 191. 
Relative hydrophobicity is shown above the line marked 
"0" , and relative hydrophilicity is shown below the line 
marked "C". 

5 Figure 3 depicts the cDNA sequence (SEQ ID N0:4) 

and predicted amino acid sequence (SEQ ID NO: 5) of a 
partial human TANGO 195 clone. The open reading frame of 
SEQ ID NO: 4 extends from nucleotide 166 to 1101, 
inclusive (SEQ ID NO: 6) . 
10 Figure 4 is a hydropathy plot of TANGO 195. 

Relative hydrophobicity is shown above the line marked 
rl 0", and relative hydrophilicity is shown below the line 
marked "0". 

Figure 5 depicts an alignment of the amino acid 
15 sequences of human TANGO 195 (SEQ ID NO: 5) and human SLAK 
(SEQ ID NO: 20) . In this alignment the sequences are 
22.8% identical overall. 

Figure 6 depicts an alignment of portions of TANGO 
191 with PF00047, an IG superfamily domain HMM (SEQ ID 
20 NOs 121, 22, and 23) . 

Figure 7 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ; of 

murine TANGO 195. The open reading frame of SEQ ID 

NO: extends from nucleotide 42 to 876, inclusive (SEQ 

25 ID NO: ) . 

Figure 8 depicts the cDNA sequence (SEQ ID NO: 

and predicted amino acid sequence (SEQ ID NO: ) of a 

partial human TANGO 195 clone (T195Athpb93f 1 ) . The open 
reading frame extends from nucleotide 159 to 1118, 

30 inclusive (SEQ ID NO: ). 

Figure 9 depicts the cDNA sequence (SEQ. ID NO: 

and predicted ammo acid sequence (SEQ ID NO: ; of a 

full length TANGO 195 clone (T195AthLal70f 10) . The open 
reading frame extends from nucleotide 25 to nucleotide 
35 879, inclusive (SEQ ID NO: ) . 
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which engages a protein complex that acts to activate NK- 
yB. Members of the NK-yB family regulate many of immune 
and inflammatory genes that are induced by IL-1. 

Since TANGO 191 has some similarity to IL-1 
5 receptor, TANGO 191 nucleic acids and polypeptides as 
well as modulators of TANGO 191 expression or activity 
are useful in the treatment of a variety of immune and 
inflammatory disorders, e.g., asthma, graft vs-host 
disease, rheumatoid arthritis, psoriasis, inflammatory 
10 bowel disease, septic shock, ulcerative colitis, Crohn's 
disease, chronic myelogenous leukemia, cancer, liver 
disease, Hodgkin's disease, osteoarthritis, Lyme disease, 
cachexia, and autoimmune diseases, e.g., myasthenia 
gravis, autoimmune diabetes, and lupus. 

15 TANGO 195 

The human TANGO 195 partial cDNA of SEQ ID NO: 4 
has a 93 6 nucleotide open reading frame (SEQ ID NO: 6) 
encoding a 312 amino acid protein (SEQ ID NO: 5) . The 
cDNA and protein sequences of human TANGO 195 clone are 

20 shown in Figure 3. This partial TANGO 195 cDNA clone was 
isolated from a human mixed lymphocyte reaction library 
based on its homology to signalling lymphocyte activation 
marker (Cocks et al . (1995) Nature 376:260-63). Apoarenz 
full-length clones (3.0 kb and 1.3 kb) were isolated from 

2 5 the same library and a human mid- term placental library. 

The portion of human TANGO 195 encoded by the cDNA 
of SEQ ID NO: 4 is a transmembrane secreted protein having 
a 22 amino acid signal sequence (amino acids 1 - 22 of 
SEQ ID NO: 5; SEQ ID NO: 12} followed by a 2 91 ammo acid 

30 mature protein (amino acids 23 - 312 of SEQ ID NO: 5; SEQ 
ID NO: 13 ; . The portion of mature TANGO 195 encoded by 
the cDNA of SEQ ID NO : 4 is predicted to have a 
transmembrane domain that extends from amino acid 234 to 
amine acid 254 of SEQ ID NO:5 (SEQ ID NO:15), an 
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12 potential casein kinase II phosphorylation sites 
(amino acids 36-39, 89-92, 133-136, 224-227, 294-297, 
301-304, 311-314, 327-330, 401-404, 427-430, 490-493, and 
585-588 of SEQ ID NO ; 2 ) ; one potential tyrosine kinase 

> phosphorylation site (amino acids 205-212 of SEQ ID 
NO:2); and six potential N-myristoylation sites (amino 
acids 117-122, 168-173, 217-222, 366-371, 460-465, and 
583-588 of SEQ ID NO:2) . 

Figure 2 is a hydropathy plot of TANGO 191. 

) Relative hydrophobicity is shown above the line marked 
"0", and relative hydrophilicity is shown below the line 
marked " 0 " . 

Northern analysis of human TANGO 191 mRNA 
expression revealed that it is expressed in spleen, lymph 

5 node, peripheral blood lymphocytes, and bone marrow. 

A clone (EPftX191a) containing a cDNA encoding 
TANGO 191 inserted into pZL-1 (GIBCO/BRL; Bethesda, MD) 
between the NotI and Sail sites was deposited with the 
American Type Culture Collection, Manassas, VA on 

) September 9, 1998, and assigned Accession Number . 

Human TANGO 191 appears to be a member of the IL-1 
receptor superfamily. TANGO 191 includes three regions 
(amino acids 71-123 of SEQ ID NO:2; SEQ ID NO:17); amino 
aicds 166-223 of SEQ ID NO : 2 ; SEQ ID NO:18); amino acids 

5 266-339 of SEQ ID NO;2; SEQ ID NO:19) which have homology 
to the IG superfamily domain (PF00047) that is 
characteristic of members of the IL-1 superfamily (Figure 
6) . 

IL-1 receptor (IL-1R) plays a critical role the 
0 regulation of immune and inflammatory responses. 

Signalling by IL-1R requires that IL-1R form a complex 
with IL-lAcP, a protein which may be required for 
internalization of IL-1R. It is thought that both IL-1R 
and IL-lAcP interact with IRAK- 2. It has been proposed 
5 that this multiprotein complex interacts with TRAF6 , 
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full length TANGO 195 clone (T195AthLal70f 10 ) . The open 
reading frame extends from nucleotide 2 5 to nucleotide 

879, inclusive (SEQ ID NO: ). 

The full-length TANGO 195 protein of Figure 9 is 
5 predicted to be a transmembrane protein having a 22 amino 
acid signal sequence (amino acids 1-22 of SEQ ID NO: 

SEQ ID NO: ) followed by a 263 amino acid mature 

protein (amino acids 23-285 of SEQ ID NO: ) . This form 

of TANGO 195 is predicted to have a transmembrane domain 

10 extending from amino acid 234 to amino acid 254 of SEQ ID 

N0: < SE Q ID NO: ), an extracellular domain extending 

from amino acid 23 to amino acid 23 3 of SEQ IQ NO: 

(SEQ ID NO: ) and a cytoplasmic domain that extends 

from amino acid 255 to amino acid 285 of SEQ ID NO: 

15 (SEQ ID NO: ) . 

In situ expression analysis of TANGO 195 in adult 
mice revealed expression in the spleen (mutlifocal 
expression with expression highest in follicles) , thymus 
(multifocal expression) , and lymph node (multifocal 

20 expression) . No expression was detected in lung and 
stomach. In situ expression analysis was also used to 
examine expression in the spleens of adult mice 1, 3, 5, 
and 14 pose -immunization with EFA/PBS. In each case 
multifocal expression was observed with expression being 

25 highest in the follicles. The expression at 14 days 
post - immunization was somewhat lower than in at other 
time points. 

Northern analysis of TANGO 195 expression revealed 
the presence of a 1.8 kb transcript and a 3.4 kb 

3 0 transcript in spleen, lymph node and thymus and a 1.8 kb 
transcript was observed in bone marrow with expression 
being highest in lymph node. Additional Northern 
analysis revealed expression in the following tissues (in 
decreasing order of expression) : lymph node, stomach, 

35 small intestine, appendix, lung, spleen, and bone marrow. 
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extracellular domain that extends from amino acid 23 to 
ammo acid 233 of SEQ ID NO:5 (SEQ ID NC:14) / and a 
cytoplamic domain extending from amino acid 255 to amino 
acid 312 of SEQ ID NO : 5 (SEQ ID NO: 16) . 
5 TANGO 195 is a type I transmembrane protein 

belonging to the CD2 subgroup of the immunoglobulin 
superf amily . 

The TANGO 195 of SEQ ID NO : 5 has three potential 
N-glycosylation sites (amino acids 85-83, 100-103, and 

10 156-159 of SEQ ID NO : 5 ) ; three potential protein kinase C 
phosphorylation sites (amino acids 163-165, 230-232, and 
308-310 cf SEQ ID NO:5); three potential casein kinase II 
phosphorylation sites (amino acids 168-171, 215-218, and 
230-233 cf SEQ ID NO: 5) ; one potential tyrosine kinase 

15 phosphorylation site (amino acids 65-72 of SEQ ID NO: 5) ; 
one potential cGMP- dependent protein kinase 
phosphorylation site (amino acids 165-168 of SEQ ID 
NO:5;; and three potential N-myristoylation sites (amino 
acids 66-71, 110-115, and 183-188 of SEQ ID NO:5). 

20 Figure 4 is a hydropathy plot of the TANGO 195 of 

SEQ ID NO: 5. Relative hydrophobic ity is shown above the 
line marked "0", and relative hydrophilicity is shown 
belov; the line marked "0". 

Figure 7 depicts the cDNA sequence (SEQ ID NO: ■ 

25 and predicted amino acid sequence (SEQ ID NO: ) of 

murine TANGO 195. The open reading frame of SEQ ID 

NO: extends from nucleotide 42 to 876, inclusive (SEQ 

ID NO: ) . 

Figure 8 depicts the cDNA sequence (SEQ ID NO: ) 

30 and predicted amino acid sequence (SEQ ID NO: ) of a 

partial human TANGO 195 clone (T195Athpb93f 1 ) . The open 
reading frame° extends from nucleotide 159 to 111S, 

inclusive (SEQ ID NO: ) . 

Figure 9 depicts the cDNA sequence (SEQ ID NO: < 

35 and predicted amino acid sequence (SEQ ID NO: ) of a 
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hours) . The strongest expression was seen in monocytes 
activated with gIFN (0.2 or 2ug/ml) giving 2 bands of 
approximately 1.5-2 kb and 3-3.5 kb (with the upper band 
stronger) . Addition of LPS did not increase expression 
5 in monocytes or PBMCs and may in fact lead to a slight 
decrease in expression in gIFN stimulated monocytes. 

TANGO 195 function was investigated by 
reconstituting irradiated mice with bone marrow cells 
infected with retrovirus expressing either full length 

10 murine TANG0195 ( ,T T195fl M ) or the extracellular domain of 
murine TANGO 195 ( "T195ex" ) . The donor and recipient 
mice were C57BL/6 and congenic for CD45 (CD45.1 for 
donor, CD45.2 for recipient). Recipient mice were 
analyzed approximately 10 and 14 weeks after 

15 transplantation for blood chemistry, hematology and 

tissue histology. Peripheral blood, spleen, lymph node 
and thymus cells were analyzed by FACs analysis and TANGO 
195 RNA levels were analyzed in spleen. The percentage of 
infected cells, based on the percentage of G418-resistant 

20 donor cells, was 54% (T195fl) and 69% (T195ex) . The level 
of TANGO 195 RNA expression in the spleen of recipient 
mice, based upon slot blot: analysis (GAPDH control), was 
10 cimes that of control mice (328% of GAPDH for T195ex 
and 232% of GAPDH for T195fl compared to 33% of GAPDH in 

25 control mice) . 

Expression of T195ex led to a statistically 
significant increase in triglyceride levels (ave = 97.6 
mg/dl at 12 weeks; ave = 94.3 mg/dl at 16 weeks) in serum 
as compared to control mice (ave = 52.5 mg/dl at 12 week; 

30 ave = 63.1 mg/dl 16 weeks) or mice expressing T195fl (ave 
= 4 7.9 mg/dl at 12 weeks; ave = 67.5 mg/dl at 16 weeks) . 



Expression of T195fl had a variety of effects on 
lymphocytes. In T195fl expressing mice, FACs analysis cf 
peripheral blood showed an increase in total B220hi 
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However, a probe based on the open reading frame did not 
detect: expression in lung or stomach. 

Additional Northern analysis revealed that TANGO 
195 is expressed in activated human monocytes/macrophages 
5 and, at lower level, in activated human lymphocytes. 
This analysis also revealed that cytokine induced 
differentiation of T cells appears to regulate TANGO 195 
expression. To carry out this analysis, PBMCs were 
purified from human buffy coat by ficol gradient 

10 ce-ntrifugation as per manuf acturers instructions (Sigma) . 
PBMCs were activated for 2 4 hours with lug/ml LPS . Using 
Miltenyi Biotech positive selection beads, CD4+, CD8+ and 
CD19+ cells were isolated from resting PBMCs. The CD8+ 
cells were activated for 24 hours using plate bound anti- 

15 CD3 . Resting monocytes were purified from PBMCs by 

gradient centrif ugation and stimulated with 0, 0.01, 0.1 
or lug/ml LPS either with or without 2ng/ml gIFN for 4 
hours or with 0, 0.2 or 2 ng/ml gIFN for 4 hours. 
Activated T cells (predominantly CD4 + cells) were 

20 prepared from PBMCs by stimulation with plate bound anti- 
CD3 (TR66) for 3 days in RPMI 10% FBS . Cells were then 
expanded without anti-CD3 but in the presence of IL2 for 
22 days. Cells were resuspended at 107 /mi in fresh RPMI 
10% FBS with no exogenous cytokines, or IL10 (20ng/ml) 

25 plus IL4 (40ng/ml) or TNFa (5,000u/ml) plus gIFN 

(15ng/ml) . (All cytokines were purchased from Genzyme.) 
Cells were harvested after 8 and 24 hours. RNA was 
prepared from all cell types using RNeasy mini kit 
(Qiagen) and expression was analyzed by standard Northern 

30 analysis using approx lOug total RNA. No expression was 
seen in resting leukocytes PBMCs , CD4+, CD8+, CD19 + or 
monocytes. A single band of approximately 3-3.5 kb was 
seen in activated CD8+ cells and in T cells activated 
with no exogenous cytokines or the combination of IL10 

35 and IL4, but not with TNFa and gIFN (after both 3 and 24 
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These results also suggest thar TANGO 195 may pla^ 
a role in B cell leukemia, immune response, and 
autoimmune disorders (e.g., arthritis). 

TANGO 195 maps to human chromosome locus hulq21. 
5 The flanking markers are AFMA323ZE5 and D1S263 5. The 
among identified loci in close proximity to TANGO 195 are 
HYPLP1 (hyperlipidemial) and LPD1 (lipodystrophy) . 
Nearby known human genes include: SPTA1 (spectrin, 
alpha), THBS3 ( thrombospondin 3), MTX (metaxin) , CTSS 

10 (cathepsin K, S) , FLG (f ilaggrin) , PKLR (pyruvate kinase) , 
HYPLIP1 (hyperlipidemia) . 

The mouse chromosome corresponding to the human 
chromosomal locus is chromosome 3. Nearby mouse loci 
include: soc (soft coat), hyplipl (hyperlipidemia), ft 

15 (flaky tail) and ma (matted) . Nearby mapped mouse genes 
include: Imna (lamin A), fig (f ilaggrin) , bean 
(brevican) , gba (acid beta glucosidase) . 

Rabbit polyclonal antibodies were raised against 
three peptides from murine TANGO 195 peptides to amino 

20 acids 26-34, 102-117 and 161-176. Peptide purified sera 
from rabbits immunized with amino acids 102-117 
specifically recognizes mouse T195-hFc protein by 
scandarad Western Blot. Additionally unpurified sera 
from rabbits immunized with amino acids 102-117 recognize 

25 mouse T195-hFc by ELISA. ELISA plates were coated with 5 
Tg/ml mouse T195-hFc or human lg control in PBS overnight 
at 4°C. Plates were washed and blocked with PBS 1% PSA. 
Serial dilutions of serum were added and incubated for 
approximately 2 hours at room temperature. Plates were 

30 washed and rabbit immunoglobulin detected with anti- 
rabbit iq-HRP. Serum from rabbits immunized with the 
peptide corresponding to amino acids 102-117 showed 
greater than 20 fold higher titres against mouse T195-hFc 
compared to human lg, and showed greater than 20 fold 
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(CD45Rhi) , IgD+ B cells compared to control mice or mice 
expressing T195ex. The B220hi (CD45Rhi) , IgD+ B cells 
showed low levels of expression of Macl which is 
generally regarded as a marker for cells of the 
monocyte /macrophage lineage. FACs analysis of peripheral 
blood showed a slight decrease in CD4 + T cells in T195fl 
expressing mice compared to control mice and T195ex 
expressing mice. FACs analysis of spleen and lymph node 
cells showed a similar increase in B220+ IgD+ Macllo 
cells. These cells had similar levels of surface IgD and 
IgM compared to B220 + cells from control and T195ex mice. 
CD45.1 staining confirmed that the B220-r IgD+ Macllo 
cells in spleen, lymph nodes and peripheral blood were 
derived from donor bone marrow. 

In T195fl expressing mice, FACs analysis of 
oeritoneal lavage cells showed an increase in total B220- 
cells, an increase in B220+ CD231o cells, and an increase 
in B220+ Macllo cells compared to control mice of T195ex 
expressing mice. B220+ cells showed slightly lower 
0 levels of IgD compared to B220+ cells from control mice 
or T195ex expressing mice. In addition, B220+ CD23lo 
cells in T195fl expressing mice showed significantly 
lower expression of B220 compared to control and T195ex 
mice. Finally, it was observed that very few CD5^ B220- 
5 were seen on peritoneal lavage from T195fl, T195ecd or 
control mice. 

Taken together, these results suggest that 
increased expression of T195fl in bone marrow derived 
cells leads to an increase in Bl like cells in the 
0 periphery. Since Bl cells are not normally bone marrow 
derived the exact lineage of the cells is difficult to 
determine however they appear to be Bib cells. These 
results also suggest that increased expression of .T195f 1 
on bone marrow derived cells may lead to a decrease in 
5 CD4+ T ceils in the periphery. 
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TANGO 195 expression or activity may be useful in the 
treatment of diorders associated with abberrant B cell 
expansion or differentiation or abberrant cytokine 
production, e.g., allergic and autoimmune disorders. 

5 TABLE 1: Summary of Human TANGO 191 and TANGO 195 



Sequence Information 



Gene 


cDNA 


ORF 


Protein 


Figure 


Accession 
No. 


TANGO 19- 


SEQ ID NO:l 


SEQ ID NO: 3 


SEQ ID NO: 2 


Fig. 1 




TANGO 195 
(partial) 


SEQ ID NO: 4 


SEQ ID NO: 6 


SEQ ID NO: 5 


Fig. 2 




TANGO 195 
(partial > 


SEQ ID NO: 


SEQ ID NO: 


SEQ ID NO: 


Fig. 8 




TANGO 19 5 
full-length 


SEQ ID NO: 


SEQ ID NO: 


SEQ ID NO: 


Fig. 9 





15 TABLE 2: Summary of Domains of TANGO 191 and TANGO 195 





Protein 


Signal 


Mature 


Excraceliula 


Transmemorane 


Cvtcciasmic 






Sequence 


Protein 


r- 


Domain 


Dotna i n 










Domain 








TANGC 191 


aa 1-19 


aa 20-599 


aa 20-357 


aa 358-382 


aa je3-59S 




SEQ ID NO: 2 


SEQ ID NO: 7 


SEQ ID NO: 8 


SEQ ID NO: 9 


SEQ ID NO: 10 


SEQ ID NO: 11 




TANGO 19 5 


aa 1-22 


aa 23-312 


aa 20-233 


aa 234-254 


aa 255-312 


20 


SEQ ID NO: 5 


SEQ ID 


SEQ ID NO: 13 


SEQ ID NO: 14 


SEQ ID NO: 15 


SEQ ID NO: 16 






NO: 12 











Various aspects of the invention are described in 
further detail in the following subsections 
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higher titres against mouse T195-hFc compared to control 
serum. 

Several TANGO 195 /immunoglobulin constant region 
fusion proteins were created. Using human TANGO 195 a 
5 fusion protein consisting of TANGO 195 (aal-233)- 
AAPGGASYKD- human IgGlfc was created. A second human 
TANGO 195 fusion substituted murine IgGlfc for human 
IgGlfc. Using murine TANGO 195 a fusion protein 
consisting of TANGO 195 (aal to 231 ) -AASGKASYKD -human 

10 IgGlfc was created. A second murine TANGO 195 fusion 
protein substituted murine IgGlfc for human IgGlfc. 

A clone (EpjthPb093fol) containing a 1.3- kb cDNA 
encoding apparently full-length TANGO 19 5 in pMET7 
between Not I and Sail was deposited with the American 

15 Type Culture Collection, Manassas , VA on September 9, 

1998, and assigned Accession Number . 

TANGO 195 has regions that are significantly 
similar to human signalling lymphocyte activation 
molecule ("SLAM") (Accession Number U33017) . For 

20 example, the region of TANGO 195 from amino acid 173 to 
amino acid 250 has 32% identity (25/73 amino aicds) and 
50% identity (39/59 amino acids) to the corresponding 
region of SLAM; the region of TANGO 195 from amino acid 
134 to amino acid 164 has 32% identity (10/31 amino 

25 acids) and 41% identity (13/31 amino acids) to the 

corresponding region of SLAM; and the region of TANGO 195 
from amino acid 117 to amino acid 132 has 43% identity 
(7/16 amino acids) and 75% identity (12/16 amino acids) 
to the corresponding region of SLAM (Figure 5) . 

30 SLAM is thought to enhance the expansion and 

differentiation of activated B cells (Punnonen et al . 

(1997) J. Exp. Med. 185:993-1004) and in the regulation 
of cype 1 and type 2 cytokine production (Ferrante et al . 

(1998) C . Immunology 160:1514-21). TANGO 195 likely has 
35 a function similar to that of SLAM. Thus, modulators of 
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A nucleic acid molecule of the present invention, 
e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO : 1 , 3, 4, or 6, the cDNA of ATCC 

, or the cDNA of ATCC , or a complement 

thereof; can be isolated using standard molecular biology 
techniques and the sequence information provided herein. 
Using all or a portion of the nucleic acid sequences of 

SEQ ID NO: 1, 3, 4, or 6, the cDNA of ATCC , or the 

cDNA of ATCC as a hybridization probe, nucleic 

acid molecules of the invention can be isolated using 
standard hybridization and cloning techniques (e.g., as 
described in Sambrook et al . , eds . , Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) . 

A nucleic acid molecule of the invention can be 
amplified using cDNA, mRNA or genomic DNA as a template 
and appropriate oligonucleotide primers according to 
standard PCR amplification techniques. The nucleic acid 
so amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a 
nucleic acid molecule of the invention can be prepared by 
standard synthetic techniques, e.g., using an aucomated 
DNA synthesizer. 

In another preferred embodiment, an isolated nucleic 
acid molecule of the invention comprises a nucleic acid 
molecule which is a complement of the nucleotide sequence 

of SEQ ID NO:l, 3, 4, or 6 , the cDNA of ATCC , or 

the cDNA of ATCC , or a portion thereof. A nucleic 

acid molecule which is complementary to a given 
nucleotide sequence is one which is sufficiently 
complementary to the given nucleotide sequence that it 
can hybridize to the given nucleotide sequence thereby 
forming a stable duplex. 
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I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode a polypeptide of the 
invention or a biologically active portion thereof, as 
5 well as nucleic acid molecules sufficient for use as 
hybridization probes to identify nucleic acid molecules 
encoding a polypeptide of the invention and fragments of 
such nucleic acid molecules suitable for use as PCR 
primers for the amplification or mutation of nucleic acid 

10 molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
analogs. The nucleic acid molecule can be single- 

15 stranded or double -stranded, but preferably is double- 
stranded DNA. 

An "isolated" nucleic acid molecule is one which is 
separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid 

20 molecule. Preferably, an "isolated" nucleic acid 
molecule is free of sequences (preferably protein 
encoding sequences) which naturally flank the nucleic 
acid (i.e., sequences located at the 5' and 3' ends of 
the nucleic acid) in the genomic DNA of the organism from 

2 5 which the nucleic acid is derived. For example, in 

various embodiments, the isolated nucleic acid molecule 
can contain less than about 5kB, 4 kB, 3 kB, 2 kB, lkB, 
0.5 kB or 0.1 kB of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the 

30 cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors 

35 or other chemicals when chemically synthesized. 
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A nucleic acid fragment encoding a "biologically 
active portion" of a polypeptide of the invention can be 
prepared by isolating a portion of any of SEQ ID NO: 3 or 

6, the nucleotide sequence of the cDNA of ATCC , or 

the nucleotide sequence of the cDNA of ATCC which 

encodes a polypeptide having a biological activity, 
expressing the encoded portion of the polypeptide protein 
(e.g., by recombinant expression in vitro) and assessing 
the activity of the encoded portion of the polypeptide. 

The invention further encompasses nucleic acid 
molecules that differ from the nucleotide sequence of SEQ 

ID NO:l, 3, 4, or 6 , the cDNA of ATCC , or the cDNA 

of ATCC due to degeneracy of the genetic code and 

thus encode the same protein as that encoded by the 
nucleotide sequence of SEQ ID NO : 1 , 3, 4, or 6 ( the cDNA 

of ATCC , or the cDNA of ATCC . 

In addition to the nucleotide sequences shown in SEQ 

ID NO: 3 and 6 and present in the cDNA of ATCC and 

the cDNA of ATCC , it will be appreciated by those 

skilled in the art that DNA sequence polymorphisms that 
lead to changes in the amino acid sequence may exist 
within a population (e.g., the human population) . Such 
genetic polymorphisms may exist among individuals within 
a population due to natural allelic variation. An allele 
is one of a group of genes which occur alternatively at a 
given genetic locus. As used herein, the phrase "allelic 
variant" refers to a nucleotide sequence which occurs at 
a given locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 
and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a polypeptide 
of the invention. Such natural allelic variations can 
typically result in 1-5% variance in the nucleotide 
sequence of a given gene. Alternative alleles can be 
identified by sequencing the gene of interest in a number 



WO 00/18800 



PCT7US99/22818 



23 



Moreover, a nucleic acid molecule of the invention 
can comprise only a portion of a nucleic acid sequence 
encoding a full length polypeptide of the invention for 
example, a fragment which can be used as a probe or 
5 primer or a fragment encoding a biologically active 

portion of a polypeptide of the invention. The nucleotide 
sequence determined from the cloning one gene allows for 
the generation of probes and primers designed for use in 
identifying and/or cloning homologues in other cell 
10 types, e.g., from other tissues, as well as homologues 

from other mammals. The probe/primer typically comprises 
substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 
15 conditions to at least about 12, preferably about 25, 
more preferably about 50, 75, 100, 125, 150, 175, 200, 
250, 300, 350 or 400 consecutive nucleotides of the sense 
or anti- sense strand of SEQ ID NO:l, 3, 4, or 6 , the cDNA 

ATCC f ox the cDNA of ATCC or of a naturally 

20 occurring mutant of SEQ NO: 1, 3, 4, or 6, the cDNA of 

ATCC , or the cDNA of ATCC . 

Probes based on the sequence of a nucleic acid 
molecule of the invention can be used co detect 
transcripts or genomic sequences encoding the same 
25 protein molecule encoded by a selected nucleic acid 
molecule. The probe comprises a label group attached 
thereto, e.g., a radioisotope, a fluorescent compound, an 
enzyme, or an enzyme co- factor. Such probes can be used 
as part of a diagnostic test kit for identifying cells or 
3 0 tissues which mis-express the protein, such as by 

measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., 
detecting mRNA levels or decermining whether a gene 
encoding the protein has been mutated or deleted. 
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As used herein, the term "hybridizes under stringent 
conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 
5 identical to each other typically remain hybridized to 
each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. A preferred, non-limiting example of 

10 stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2 X SSC, 0.1% SDS at 
50-65°C. Preferably, an isolated nucleic acid molecule 
of the invention that hybridizes under stringent 

15 conditions to the sequence of SEQ ID N0:1, 3, 4 , or 6, 

the cDNA of ATCC , or the cDNA of ATCC , or the 

complement thereof, corresponds to a naturally-occurring 
nucleic acid molecule. As used herein, a "naturally- 
occurring 11 nucleic acid molecule refers to an RNA or DNA 

20 molecule having a nucleotide sequence that occurs in 
nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants 
of a nucleic acid molecule of the invention sequence that 
may exist in the population, the skilled artisan will 

2 5 further appreciate that changes can be introduced by 
mutation thereby leading to changes in the amino acid 
sequence of the encoded protein, without altering the 
biological activity of the protein. For example, one can 
make nucleotide substitutions leading to amino acid 

30 substitutions at "non-essential" amino acid residues. A 
"non-essential" amino acid residue is a residue that can 
be altered from the wild-type sequence without altering 
the biological activity, whereas an "essential" amino 
acid residue is required for biological activity. For 

35 example, amino acid residues that are not conserved or 
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of different individuals. This can be readily carried 
out by using hybridization probes to identify the same 
genetic locus m a variety of individuals. Any and all 
such nucleotide Variations and resulting amino acid 
5 polymorphisms or variations that are the result of 
natural allelic variation and that do not alter the 
functional activity are intended to be within the scope 
of the invention. 

Moreover, nucleic acid molecules encoding proteins 

10 of- the invention from other species (homologues) , which 
have a nucleotide sequence which differs from that of the 
protein described herein are intended to be within the 
scope of the invention. Nucleic acid molecules 
corresponding to natural allelic variants and homologues 

15 of a cDNA of the invention can be isolated based on their 
identity to the nucleic acid molecule disclosed herein 
using a cDNA , or a portion thereof, as a hybridization 
probe according to standard hybridization techniques 
under stringent hybridization conditions. For example, a 

20 cDNA encoding a soluble form of a membrane -bound protein 
of the invention isolated based on its hybridization to a 
nucleic acid molecule encoding all or part of the 
membrane -bound form. Likewise, a cDNA encoding a 
membrane -bound form can be isolated based on its 

25 hybridization to a nucleic acid molecule encoding all or 
part of the soluble form. 

Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 

30 800, 900, 1000, or 1290) nucleotides in length and 

hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence, preferably 
the coding sequence, of SEQ ID N0:1, 3, 4, or 6 , the cDNA 
of ATCC , the cDNA of ATCC , or a complement 

3 5 thereof. 
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acids with basic side chains (e.g., lysine, arginine, 
histidine), acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glutamine, serine, threonine, 
5 tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine). 
10 Alternatively, mutations can be introduced randomly along 
all or part of the coding sequence, such as by 
saturation mutagenesis, and the resultant mutants can be 
screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded 
15 protein can be expressed recombinantly and the activity 
of the protein can be determined. 

In a preferred embodiment, a mutant polypeptide that 
is a variant of a polypeptide of the invention can be 
assayed for: (1) the ability to form protein:protein 
20 interactions with proteins in a signalling pathway of the 
polypeptide of the invention; (2) the ability to bind a 
ligand of the polypeptide of the invention; or (3) the 
ability to bind to an intracellular target protein of the 
polypeptide of the invention. In yet another preferred 
2 5 embodiment, the mutant polypeptide can be assayed for the 
ability to modulate cellular proliferation or cellular 
differentiation. 

The present invention encompasses antisense nucleic 
acid molecules, i.e., molecules which are complementary 
30 to a sense nucleic acid encoding a polypeptide of the 

invention, e.g., complementary to the coding strand of a 
double -stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 
35 nucleic acid can be complementary to an entire coding 
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only semi -conserved among homologues of various species 
may be non-essential for activity and thus would be 
likely targets for alteration. Alternatively, ammo acid 
residues that are conserved among the homologues of 
5 various species (e.g., murine and human) may be essential 
for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention 
pertains to nucleic acid molecules encoding a polypeptide 

10 of the invention that contain changes in amino acid 
residues that are not essential for activity. Such 
polypeptides differ in amino acid sequence from SEQ ID 
NO: 2, 5, 8, and 13 yet retain biological activity. In 
one embodiment, the isolated nucleic acid molecule 

15 includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 
45% identical, 65%, 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of any of SEQ ID NO; 2, 5, 8, or 
13 . 

20 An isolated nucleic acid molecule encoding a variant 
protein can be created by introducing one or more 
nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ ID NO:l, 3, 4, or 6, the cDNA 
of A ?CC , or the cDNA of ATCC such that one 

2 5 or more amino acid substitutions, additions or deletions 

are introduced into the encoded protein. Mutations can 
be introduced by standard techniques, such as site- 
directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are 

3 0 made at one or more predicted non-essential amino acid 

residues. A "conservative amino acid substitution" is 
one m which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. Families 
of amino acid residues having similar side chains have 
35 been defined in the art. These families include amino 
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methoxycarboxymethyluracil , 5 -methoxyuracil , 2- 
methyl thio-N6 - isopentenyladenine , uracil - 5 -oxyacetic acid 
(v) , wybutoxosine , pseudouracil , queosine, 2- 
thiocytosme, 5-methyl-2-thiouracil , 2 -thiouracil , 4- 
thiouracil, 5-methyluracil , uracil-5-oxyacetic acid 
methylester, uracil -5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)w / and 2 , 6 -diaminopurine . Alternatively, the 
antisense nucleic acid can be produced biologically using 
an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of 
interest, described further in the following subsection) . 

The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a selected 
polypeptide of the invention to thereby inhibit 
expression, e.g., by inhibiting transcription and/or 
translation. The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, 
for example, in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An 
example of a route of administration of antisense nucleic 
acid molecules of the invention includes direct injection 
at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
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strand, or to only a portion thereof, e.g., all or part 
of the protein coding region (or open reading frame) . An 
antisense nucleic acid molecule can be antisense to all 
or part of a noncoding region of the coding strand of a 
5 nucleotide sequence encoding a polypeptide of the 
invention. The noncoding regions ("5' and 3' 
untranslated regions' 1 ) are the 5' and 3' sequences which 
flank the coding region and are not translated into amino 
acids . 

10 An antisense oligonucleotide can be, for example, 

about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides 
in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 

15 For example, an antisense nucleic acid (e.g., an 

antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 

2 0 stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 
Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 5- 

25 f luorouracil , 5-bromouracil , 5-chlorouracil , 5- 

iodouracil, hypoxanthine , xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxyl methyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil , dihydrouracil , beta-D- 

30 galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methyl guanine, 1-methylinosine, 2 , 2 -dimethyl guanine, 2- 
methyladenine , 2 -me thylguanine , 3 -methylcytosine , 5 - 
methylcytosine , N6-adenine, 7 -methylguanine , 
methylaminomethyluracil , 5-methoxyaminomethyl-2- 

35 thiouracil, beta-D-mannosylqueos.ine, 5'- 
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Alternatively, an mRNA encoding a polypeptide of the 
invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA 
molecules. See, e.g., Bartel and Szostak (1993) Science 
5 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
example, expression of a polypeptide of the invention can 
be inhibited by targeting nucleotide sequences 
0 complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or 
enhancer) to form triple helical structures that prevent 
transcription of the gene in target cells. See generally 
Helene (1991) Anticancer Drug Des. 6(6):569-84; Helene 
5 (1992) Ann. N.Y. Acad. Sci . 660:27-36; and Maher (1992) 
Bioassays 14 (12 ): 807-15 . 

In preferred embodiments, the nucleic acid molecules 
of the invention can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the 
0 stability, hybridization, or solubility of the molecule. 
For example, the deoxyribose phosphate backbone of the 
nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al . (1996) Bioorganic & Medicinal 
Chemistry 4(1) : 5-23) . As used herein, the terms 
5 "peptide nucleic acids" or " PNAs " refer to nucleic acid 
mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
0 allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996), supra; Perry-O' Keef e et al . (1996) Proc . Natl. 
5 Acad. Sci. C7SA 93: 14670-675. 
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antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 
antisense molecules ; vector constructs in which the 
5 antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

An antisense nucleic acid molecule of the invention 
can be an a-anomeric nucleic acid molecule. An a- 

10 anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual jS-units, the strands run parallel 
to each other (Gaultier et al . (1987) Nucleic Acids Res. 
15:6625-6641). The antisense nucleic acid molecule can 

15 also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 215:327- 
330) . 

The invention also encompasses ribozymes . Ribozymes 

2 0 are catalytic RNA molecules with ribonuclease activity 

which are capable of cleaving a single -stranded nucleic 
acid, such as an mRNA, to which they have a complementary 
region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 
25 334:585-591)) can be used to catalytically cleave mRNA 

transcripts to thereby inhibit translation of the protein 
encoded by the mRNA. A ribozyme having specificity for a 
nucleic acid molecule encoding a polypeptide of the 
invention can be designed based upon the nucleotide 

3 0 sequence of a cDNA disclosed herein. For example, a 

derivative of a Tetrahymena L-19 IVS RNA can be 
constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence 
to be cleaved in a Cech et al . U.S. Patent No. 4,987,071; 
35 and Cech et al . U.S. Patent No. 5,116,742. 
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PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al . (1996) Nucleic Acids Res. 
24 (17) :3357-63) . Alternatively, chimeric molecules can 
be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al . (1975) Bioorganic Med. Chem. Lett. 

5:1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see, 
e.g., Letsinger et al . (1989) Proc. Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 
modified with hybridization-triggered cleavage agents 
(see, e.g., Krol et al . (1988) Bio/Techniques 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
Res. 5:539-549) . To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 
agent, hybridization-triggered cleavage agent, etc. 

II. Isolated Proteins and Ant ibodies 

One aspect of the invention pertains to isolated 
proteins and polypeptides of the invention, and 
biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to 
raise antibodies directed against a polypeptide of the 
invention. In one embodiment, the native polypeptide can 
be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein 
purification techniques. In another embodiment, 
polypeptides of the invention are produced by recombinant 
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PNAs can be used in therapeutic and diagnostic 
applications. For example, PNAs can be used as antisense 
or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or 
5 translation arrest or inhibiting replication. PNAs can 
also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PGR clamping; 
as artificial restriction enzymes when used in 
combination with other enzymes, e.g., si nucleases (Hyrup 
10 (1996), supra; or as probes or primers for DNA sequence 
and hybridization (Hyrup (1996), supra; Perry-O' Keef e et 
al. (1996) Proc. Natl. Acad. Sci . USA 93: 14670-675). 

In another embodiment, PNAs can be modified, e.g., 
to enhance their stability or cellular uptake, by 
15 attaching lipophilic or other helper groups to PNA, by 
the formation of PNA -DNA chimeras, or by the use of 
liposomes or other techniques of drug delivery known in 
the art. For example, PNA -DNA chimeras can be generated 
which may combine the advantageous properties of PNA and 
20 DNA. Such chimeras allow DNA recognition enzymes, e.g., 
RNAse H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding 
affinity and specificity. PNA -DNA chimeras can be linked 
using linkers of appropriate lengths selected in terms of 
25 base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996), supra). The synthesis of 
PNA -DNA chimeras can be performed as described in Hyrup 
(1996), supra, and Finn et al . (1996) Nucleic Acids Res. 
24 (17) :3357-63 . For example, a DNA chain can be 
3 0 synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified 
nucleoside analogs. Compounds such as 5'-(4- 
methoxy^rityl)amino-5' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 
35 DNA (Mag et al . (1989) Nucleic Acids Res. 17:5973-88). 
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fewer amino acids than the full length protein, and 
exhibit at least one activity of the corresponding full- 
length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 
the corresponding protein. A biologically active portion 
of a protein of the invention can be a polypeptide which 
is, for example, 10, 25, 50, 100 or more amino acids in 
length. Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one 
or more of the functional activities of the native form 
of a polypeptide of the invention. 

Preferred polypeptides have the amino acid sequence 
of SEQ ID NO:2, 5, 7-11, and 12-16. Other useful 
proteins are substantially identical (e.g., at least 
about 45%, preferably 55%, 65%, 75%, 85%, 95%, or 99%) to 
SEQ ID NO: 2, 5, 7-11, and 12-16 and retain the functional 
activity of the protein of the corresponding naturally- 
occurring protein yet differ in amino acid sequence due 
to natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid 
sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 
of identical positions/total # of positions (e.g., 



WO 00/18800 



PCT/US99/22818 



- 35 - 

DNA techniques. Alternative to recombinant expression, a 
polypeptide of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 
An "isolated" or "purified 11 protein or biologically 
5 active portion thereof is substantially free of cellular 
material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or 
substantially free of chemical precursors or other 
chemicals when chemically synthesized. The language 

10 "substantially free of cellular material" includes 

preparations of protein in which the protein is separated 
from cellular components of the cells from which it is 
isolated or recombinantly produced. Thus, protein that 
is substantially free of cellular material includes 

15 preparations of protein having less than about 30%, 20%, 
10%, or 5% (by dry weight) of heterologous protein (also 
referred to herein as a "contaminating protein") . When 
the protein or biologically active portion thereof is 
recombinantly produced, it is also preferably 

20 substantially free of culture medium, i.e., culture 

medium represents less than about 20%, 10%, or 5% of the 
volume of the protein preparation. When the protein is 
produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 

25 chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the protein. Accordingly such preparations of the 
protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or compounds other than 

30 the polypeptide of interest. 

Biologically active portions of a polypeptide of the 
invention include polypeptides comprising amino acid 
sequences sufficiently identical to or derived from the 
amino acid sequence of the protein (e.g., the amino acid 

3 5 sequence of SEQ ID NO:2, 5, 8, or 13), which include 
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table, a gap length penalty of 12, and a gap penalty of 4 
can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
5 above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

The invention also provides chimeric or fusion 
proteins. As used herein, a "chimeric protein" or 
"fusion protein" comprises all or part (preferably 
0 biologically active) of a polypeptide of the invention 
operably linked to a heterologous polypeptide (i.e., a 
polypepcide other than the same polypeptide of the 
invention) . Within the fusion protein, the term 
"operably linked" is intended to indicate that the 
5 polypeptide of the invention and the heterologous 
polypeptide are fused in-frame to each other. The 
heterologous polypeptide can be fused to the N- terminus 
or C- terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in 
0 which the polypeptide of the invention is fused to the C- 
terminus of GST sequences. Such fusion proteins can 
facilitate the purification of a recombinant polypeptide 
of the invention. 

In another embodiment, the fusion protein contains a 
5 heterologous signal sequence at its N-terminus. For 
example, the native signal sequence of a polypeptide of 
the invention can be removed and replaced with a signal 
sequence from another protein. For example, the gp67 
secretory sequence of the baculovirus envelope protein 
0 can be used as a heterologous signal sequence (Current 
Protocols in Molecular Biology, Ausubel et al . , eds . , 
John Wiley & Sons, 1992) . Other examples of eukaryotic 
heterologous signal sequences include the secretory 
sequences of melittin and human placental alkaline 
5 phosphatase (Stratagene; La Jolla, California) . In yet 
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overlapping positions) x 100) . Preferably, the two 
sequences are the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
5 algorithm. A preferred, non-limiting example of a 

mathematical algorithm utilized for the comparison of two 
sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 

10 90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) J". 
Mol. Biol, 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 

15 to a nucleic acid molecules of the invention. BLAST 
protein searches can be performed with the XBLAST 
program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to a protein molecules of the 
invention. To obtain gapped alignments for comparison 

20 purposes, Gapped BLAST can be utilized as described in 
Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 
Alternatively, PSI-Blast can be used to perform an 
iterated search which detects distant relationships 
between molecules. Id. When utilizing BLAST, Gapped 

25 BLAST, and PSI-Blast programs, the default parameters of 
the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another 
preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of sequences is the 

3 0 algorithm of Myers and Miller, (1.988) CABIOS 4:11-11. 

Such an algorithm is incorporated into the ALIGN program 
(version 2.0) which is part of the GCG sequence alignment 
software package. When utilizing the ALIGN program for 
comparing amino acid sequences, a PAM12 0 weight residue 
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sequence (see, e.g., Ausubel et al . , supra). Moreover, 
many expression vectors are commercially available that 
already encode a fusion moiety (e.g., a GST polypeptide) . 
A nucleic acid encoding a polypeptide of the invention 
5 can be cloned into such an expression vector such that 
the fusion moiety is linked in- frame to the polypeptide 
of the invention. 

A signal sequence of a polypeptide of the invention 
(SEQ ID NO: 7 or 12) can be used to facilitate secretion 

10 and isolation of a secreted protein or other protein of 
interest. Signal sequences are typically characterized 
by a core of hydrophobic amino acids which are generally 
cleaved from the mature protein during secretion in one 
or more cleavage events. Such signal peptides contain 

15 processing sites that allow cleavage of the signal 

sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to 
the described polypeptides having a signal sequence, as 
well as to the signal sequence itself and to the 

20 polypepcide in the absence of the signal sequence (i.e., 
the cleavage products) . In one embodiment, a nucleic 
acid sequence encoding a signal sequence of the invention 
can be operably linked in an expression vector to a 
protein of interest, such as a protein which is 

25 ordinarily not secreted or is otherwise difficult to 
isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the 
expression vector is transformed, and the signal sequence 
is subsequently or concurrently cleaved. The protein can 

30 then be readily purified from the extracellular medium by 
art recognized methods. Alternatively, the signal 
sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a 
GST domain. 
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another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et 
al., supra) and the protein A secretory signal (Pharmacia 
Biotech; Piscataway, New Jersey) . 
5 In yet another embodiment, the fusion protein is an 

immunoglobulin fusion protein in which all or part of a 
polypeptide of the invention is fused to sequences 
derived from a member of the immunoglobulin protein 
family. The immunoglobulin fusion proteins of the 

10 invention can be incorporated into pharmaceutical 

compositions and administered to a subject to inhibit an 
interaction between a ligand (soluble or membrane -bound) 
and a protein on the surface of a cell (receptor) , to 
thereby suppress signal transduction in vivo. The 

15 immunoglobulin fusion protein can be used to affect the 
bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction 
may be useful therapeutically, both for treating 
proliferative and dif f erent iative disorders and for 

20 modulating (e.g. promoting or inhibiting) cell survival. 
Moreover, the immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce antibodies 
directed against a polypeptide of the invention in a 
subject, to purify ligands and in screening assays to 

25 identify molecules which inhibit the interaction of 
receptors with ligands. 

Chimeric and fusion protein of the invention can be 
produced by standard recombinant DNA techniques. In 
another embodiment, the fusion gene can be synthesized by 

3 0 conventional techniques including automated DNA 

synthesizers. Alternatively, PCR amplification of gene 
fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 

3 5 annealed and reamplified to generate a chimeric gene 



WO 00/18800 



PCT/US99/22818 



- 42 - 

invention for agonist or antagonist activity. In one 
embodiment, a variegated library of variants is generated 
by combinatorial mutagenesis at the nucleic acid level 
and is encoded by a variegated gene library. A 
5 variegated library of variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 
degenerate set of potential protein sequences is 
expressible as individual polypeptides, or alternatively, 

10 as a set of larger fusion proteins (e.g., for phage 
display) . There are a variety of methods which can be 
used to produce libraries of potential variants of the 
polypeptides of the invention from a degenerate 
oligonucleotide sequence. Methods for synthesizing 

15 degenerate oligonucleotides are known in the art (see, 
e.g., Narang (1983) Tetrahedron 39:3; Itakura et al . 
(1984) Annu. Rev. Biochem. 53:323; Itakura et al . (1984) 
Science 198:1056; Ike et al . (1983) Nucleic Acid Res. 
11:477) . 

20 In addition, libraries of fragments of the coding 

sequence of a polypeptide of the invention can be used to 

generate a variegated population of polypeptides for 

screening and subsequent selection of variants. For 

example, a library of coding sequence fragments can be 

o 

25 generated by treating a double stranded PCR fragment of 
the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing 
the DNA to form double stranded DNA which can include 

30 sense/antisense pairs from different nicked products, 

removing single stranded portions from reformed duplexes 
by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this 
method, an expression library can be derived which 
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In another embodiment, the signal sequences of the 
present invention can be used to identify regulatory 
sequences, e.g., promoters, enhancers, repressors. Since 
signal sequences are the most amino-terminal sequences of 
5 a peptide, it is expected that the nucleic acids which 
flank the signal sequence on its amino-terminal side will 
be regulatory sequences which affect transcription. 
Thus, a nucleotide sequence which encodes all or a 
portion of a signal sequence can be used as a probe to 

10 identify and isolate signal sequences and their flanking 
regions, and these flanking regions can be studied to 
identify regulatory elements therein. 

The present invention also pertains to variants of the 
polypeptides of the invention. Such variants have an 

15 altered amino acid sequence which can function as either 
agonists (mimetics) or as antagonists. Variants can be 
generated by mutagenesis, e.g., discrete point mutation 
or truncation. An agonist can retain substantially the 
same, or a subset, of the biological activities of the 

20 naturally occurring form of the protein. An antagonist 
of a protein can inhibit one or more of the activities of 
the naturally occurring form of the protein by, for 
example, competitively binding to a downstream or 
upstream member of a cellular signaling cascade which 

25 includes the protein of interest. Thus, specific 

biological effects can be elicited by treatment with a 
variant of limited function. Treatyent of a subject with 
a variant having a subset of the biological activities of 
the naturally occurring form of the protein can have 

30 fewer side effects in a subject relative to treatment 
with the naturally occurring form of the protein. 

Variants of a protein of the invention which function 
as either agonists (mimetics) or as antagonists can be 
identified by screening combinatorial libraries of 

35 mutants, e.g., truncation mutants, of the protein of the 
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protein, e.g., hydrophilic regions. Figures 2 and 4 are 
hydrophobic ity plots of the proteins of the invention. 
These plots or similar analyses can be used to identify 
hydrophilic regions. 
5 An immunogen typically is used to prepare antibodies by 
immunizing a suitable subject, (e.g., rabbit, goat, mouse 
or other mammal) . An appropriate immunogenic preparation 
can contain, for example, recombinantly expressed 
chemically synthesized polypeptide. The preparation can 
0 further include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains 
to antibodies directed against a polypeptide of the 
invention. The term "antibody" as used herein refers to 
5 immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules 
that contain an antigen binding site which specifically 
binds an antigen, such as a polypeptide of the invention. 
A molecule which specifically binds to a given 
0 polypeptide of the invention is a molecule which binds 
the polypeptide, but does not substantially bind other 
molecules in a sample, e.g., a biological sample, which 
naturally contains the polypeptide. Examples of 
immunologically active portions of immunoglobulin 
5 molecules include F(ab) and F(ab') 2 fragments which can be 
generated by treating the antibody with an enzyme such as 
pepsin. The invention provides polyclonal and monoclonal 
antibodies. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers 
0 to a population of antibody molecules that contain only 
one species of an antigen binding site capable of 
immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described 
above by immunizing a suitable subject with a polypeptide 
5 of the invention as an immunogen. The antibody titer in 
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encodes N- terminal and internal fragments of various 
sizes of the protein of interest. 

Several techniques are known in the art for screening 
gene products of combinatorial libraries made by point 
5 mutations or truncation, and for screening cDNA libraries 
for gene products having a selected property. The most 
widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries 
typically include cloning the gene library into 

10 replicable expression vectors, transforming appropriate 
cells with the resulting library of vectors, and 
expressing the combinatorial genes under conditions in 
which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product 

15 was detected. Recursive ensemble mutagenesis (REM), a 
technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with 
the screening assays to identify variants of a protein of 
the invention (Arkin and Yourvan (1992) Proc. Natl. Acad. 

20 Sci. USA 89:7811-7815; Delgrave et al . (1993) Protein 
Engineering- 6 (3) : 327-331) . 

An isolated polypeptide of the invention, or a fragment 
thereof, can be used as an immunogen to generate 
antibodies using standard techniques for polyclonal and 

25 monoclonal antibody preparation. The full-length 

polypeptide or protein can be used or, alternatively, the 
invention provides antigenic peptide fragments for use as 
immunogens. The antigenic peptide of a protein of the 
invention comprises at least 8 (preferably 10, 15, 20, or 

3 0 3 0) amino acid residues of the amino acid sequence of SEQ 
ID NO: 8 or 13 and encompasses an epitope of the protein 
such that an antibody raised against the peptide forms a 
specific immune complex with the protein. 

Preferred epitopes encompassed by the antigenic peptide 

35 are regions that are located on the surface of the 
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particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 
U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 

5 Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 
92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 

0 Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J. 12:725- 
734. 

Additionally, recombinant antibodies, such as chimeric 
and humanized monoclonal antibodies, comprising both 

5 human and non- human portions, which can be made using 

standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal 
antibodies can be produced by recombinant DNA techniques 
known in the art, for example using methods described in 

0 PCT Publication No. WO 87/02671; European Patent 

Application 184,187; European Patent Application 171,496; 
European Patent Application 173,494; PCT Publication No. 
WO 86/01533; U.S. Patent No. 4,816,567; European Patent 
Application 125,023; Better et al . (1988) Science 

5 240:1041-1043; Liu et al . (1987) Proc . Natl. Acad. Sci. 
USA 84:3439-3443; Liu et al . (1987) J. Immunol. 
139:3521-3526; Sun et al . (1987) Proc. Natl. Acad. Sci. 
USA 84:214-218; Nishimura et al . (1987) Cane. Res. 
47:999-1005; Wood et al . (1985) Nature 314:446-449; and 

0 Shaw et al . (1988) J. Natl. Cancer Inst. 80:1553-1559); 
Morrison (1985) Science 229:1202-1207; Oi et al . (1986) 
Bio/Techniques 4:214; U.S. Patent 5,225,539; Jones et al . 
(1986) Nature 321:552-525; Verhoeyan et al . (1988) 
Science 239:1534; and Beidler et al . (1988) J. Immunol. 

5 141:4053-4060. 
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the immunized subject can be monitored over time by- 
standard techniques, such as with an enzyme linked 
immunosorbent assay (ELISA) using immobilized 
polypeptide. If desired, the antibody molecules can be 
5 isolated from the mammal (e.g., from the blood) and 
further purified by well-known techniques, such as 
protein A chromatography to obtain the IgG fraction. At 
an appropriate time after immunization, e.g., when the 
specific antibody titers are highest, antibody-producing 

10 cells can be obtained from the subject and used to 
prepare monoclonal antibodies by standard techniques, 
such as the hybridoma technique originally described by 
Kohler and Milstein (1975) Nature 256:495-497, the human 
B cell hybridoma technique (Kozbor et al . (1983) Immunol. 

15 Today 4:12), the EBV-hybridoma technique (Cole et al . 

(1985), Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96) or trioma techniques. The 
technology for producing hybridomas is well known (see 
generally Current Protocols in Immunology (1994) Coligan 

20 et al. (eds.) John Wiley & Sons, Inc., New York, NY). 
Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of 
interest, e.g., using a standard ELISA assay. 

25 Alternative to preparing monoclonal antibody-secreting 
hybridomas, a monoclonal antibody directed against a 
polypeptide of the invention can be identified and 
isolated by screening a recombinant combinatorial 
immunoglobulin library (e.g., an antibody phage display 

30 library) with the polypeptide of interest. Kits for 
generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP 1 * Phage Display Kit, Catalog No. 

35 240612) . Additionally, examples of methods and reagents 
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An antibody directed against a polypeptide of the 
invention (e.g., monoclonal antibody) can be used to 
isolate the polypeptide by standard techniques, such as 
affinity chromatography or immunoprecipitation. 
5 Moreover, such an antibody can be used to detect the 
protein (e.g., in a cellular lysate or cell supernatant) 
in order to evaluate the abundance and pattern of 
expression of the polypeptide. The antibodies can also 
be used diagnost ically to monitor protein levels in 
10 tissue as part of a clinical testing procedure, e.g., to, 
for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic 
15 groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, 0-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic 
2 0 group complexes include streptavidin/biotin and 

avidin/biotin; examples of suitable fluorescent materials 
include umbel lif erone, fluorescein, fluorescein 
isothiocyanate , rhodamine , dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 
25 of a luminescent material includes luminol; examples of 
bioluminescent materials include lucif erase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 I , 131 I, 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 
30 Another aspect of the invention pertains to vectors, 
preferably expression vectors, containing a nucleic acid 
encoding a polypeptide of the invention (or a portion 
thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
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Completely human antibodies are particularly desirable 
for therapeutic treatment of human patients. Such 
antibodies can be produced using transgenic mice which 
are incapable of expressing endogenous immunoglobulin 
5 heavy and light chains genes, but which can express human 
heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, 
e.g., all or a portion of a polypeptide of the invention. 
Monoclonal antibodies directed against the antigen can be 

10 obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the 
transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 
mutation. Thus, using such a technique, it is possible 

15 to produce therapeutically useful IgG, IgA and IgE 
antibodies. For an overview of this technology for 
producing human antibodies, see Lonberg and Huszar (1995, 
Int. Rev, Immunol. 13:65-93). For a detailed discussion 
of this ^technology for producing human antibodies and 

20 human monoclonal antibodies and protocols for producing 
such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. In addition, 
companies such as Abgenix, Inc. (Freemont, CA) , can be 

2 5 engaged to provide human antibodieis directed against a 
selected antigen using technology similar to that 
described above . 

Completely human antibodies which recognize a selected 
epitope can be generated using a technique referred to as 

30 "guided selection." In this approach a selected 

non-human monoclonal antibody, e.g., a murine antibody, 
is used to guide the selection of a completely human 
antibody recognizing the same epitope (Jespers et al. 
(1994) Bio/Technology 12:899-903). 
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the vector is introduced into the host cell) . The term 
"regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.g., 
polyadenylation signals) . Such regulatory sequences are 
5 described, for example, in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990) . Regulatory sequences include those 
which direct constitutive expression of a nucleotide 
sequence in many types of host cell and those which 
10 direct expression of the nucleotide sequence only in 
certain host cells (e.g., tissue-specific regulatory 
sequences) . It will be appreciated by those skilled in 
the art that the design of the expression vector can 
depend on such factors as the choice of the host cell to 
15 be transformed, the level of expression of protein 

desired, etc. The expression vectors of the invention 
can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or 
peptides, encoded by nucleic acids as described herein. 
20 The recombinant expression vectors of the invention can 
be designed for expression of a polypeptide of the 
invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells such as E. coli, insect cells (using 
baculovirus expression vectors) , yeast cells or mammalian 
25 cells. Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant 
expression vector can be transcribed and translated in 
vitro, lor example using T7 promoter regulatory sequences 
and T7 polymerase. 
30 Expression of proteins in prokaryotes is most often 
carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 
3 5 encoded therein, usually to the amino terminus of the 
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nucleic acid to which it has been linked. One type of 
vector is a "plasmid" , which refers to a circular double 
stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, 
5 wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 

10 vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 

15 expression of genes to which they are operably linked. 
In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
such other forms of expression vectors, such as viral 

20 vectors (e.g., replication defective retroviruses, 

adenoviruses and adeno-associated viruses) , which serve 
equivalent functions . 

The recombinant expression vectors of the invention 
comprise a nucleic acid of the invention in a form 

25 suitable for expression of the nucleic acid in a host 

cell . This means that the recombinant expression vectors 
include one or more regulatory sequences, selected on the 
basis of the host cells to be used for expression, which 
is operably linked to the nucleic acid sequence to be 

3 0 expressed. Within a recombinant expression vector, 

"operably linked" is intended to mean that the nucleotide 
sequence of interest is linked to the regulatory 
sequence (s) in a manner which allows for expression of 
the nucleotide sequence (e.g., in an in vitro 

35 transcription/translation system or in a host cell when 
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recombinant protein (Gottesman, Gene Expression 
Technology ; Methods in Enzymology 18 5, Academic Press, 
San Diego, California (1990) 119-128) . Another strategy 
is to alter the nucleic acid sequence of the nucleic acid 
to be inserted into an expression vector so that the 
individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al . (1992) 
Nucleic Acids Res. 20:2111-2118). Such alteration of 
nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast 
expression vector. Examples of vectors for expression in 
yeast S. cerivisae include pYepSecl (Baldari et al . 
(1987) EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, 
(1982) Cell 30:933-943), pJRY88 (Schultz et al . (1987) 
Gene 54:113-123), pYES2 (Invitrogen Corporation, San 
Diego, CA) , and pPicZ (Invitrogen Corp, San Diego, CA) . 

Alternatively, the expression vector is a baculovirus 
expression vector. Baculovirus vectors available for 
expression of proteins in cultured insect cells (e.g., Sf 
9 cells) include the pAc series (Smith et al . (1983) Mol. 
Cell Biol. 3:2156-2165) and the pVL series (Lucklow and 
Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the 
invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840) and pMT2PC (Kaufman et al . (1987) EMBO J. 6:187- 
195) . When used in mammalian cells, the expression 
vector's control functions are often provided by viral 
regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook et al . , supra. 
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recombinant protein. Such fusion vectors typically serve 
three purposes: l) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
5 recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 

10 fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 

15 pMAL (New England Biolabs, Beverly, MA) and pRIT5 

(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST) , maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 
Examples of suitable inducible non-fusion E. coli 

20 expression vectors include pTrc (Amann et al., (1988) 
Gene 69:301-315) and pET lid (Studier et al . , Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 

25 host RNA polymerase transcription from a hybrid trp-lac 
fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 

30 s-rains BL2KDE3) or HMS174 (DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

One strategy tc maximize recombinant protein expression 
in E . coli is to express the protein in a host bacteria 

35 with an impaired capacity to proteolyt ically cleave the 
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cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen which 
direct constitutive, tissue specific or cell type 
specific expression of antisense RNA. The antisense 
5 expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense 
nucleic acids are produced under the control of a high 
efficiency regulatory region, the activity of which can 
be determined by the cell type into which the vector is 
10 introduced. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub et al . 
{Reviews - Trends in Genetics, Vol. 1(1) 1986). 

Another aspect of the invention pertains to host cells 
into which a recombinant expression vector of the 
15 invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 
20 may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 
herein . 

25 A host cell can be any prokaryotic (e.g., E. coli) or 
eukaryotic cell (e.g., an insect cell, yeast, or a 
mammalian cell) . 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
30 transfection techniques. As used herein, the terms 

" transformation" and "transfection" are intended to refer 
to a variety of arc -recognized techniques for introducing 
foreign nucleic acid into a host cell, including calcium 
phosphace or calcium chloride co-precipitation, DEAE- 
35 dextran-mediated cransf ection, lipofection, or 
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In another embodiment, the recombinant mammalian 
expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
5 express the nucleic acid) . Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin 
promoter (liver-specific ; Pinkert et al . (1987) Genes 
Dev. 1:268-277), lymphoid- specif ic promoters (Calameand 

10 Eaton (1988) Adv. Immunol. 43:235-275), in particular 
promoters of T cell receptors (Winoto and Baltimore 
(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron-specific promoters (e.g., the 

15 neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad. Sci . USA 86:5473-5477), pancreas-specific 
promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 

20 Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 
249:374-379) and the a- fetoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

2 5 The invention further provides a recombinant expression 

vector comprising a DNA molecule of the invention cloned 
into the expression vector in an antisense orientation. 
That is, the DNA molecule is operably linked to a 
regulatory sequence in a manner which allows for 
30 expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to the mRNA encoding a 
polypeptide of the invention. Regulatory sequences 
operably linked :o a nucleic acid cloned in the antisense 
orientation can be chosen which direct the continuous 

3 5 expression of the antisense RNA molecule in a variety of 
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non-human transgenic animals in which exogenous sequences 
encoding a polypeptide of the invention have been 
introduced into their genome or homologous recombinant 
animals in which endogenous encoding a polypeptide of the 
5 invention sequences have been altered. Such animals are 
useful for studying the function and/or activity of the 
polypeptide and for identifying and/or evaluating 
modulators of polypeptide activity. As used herein, a 
"transgenic animal" is a -non -human animal, preferably a 
10 mammal, more preferably a rodent such as a rat or mouse, 
in which one or more of the cells of the animal includes 
a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, 
chickens, amphibians, etc. A transgene is exogenous DNA 
15 which is integrated into the genome of a cell from which 
a transgenic animal develops and which remains in the 
genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. As used 
20 herein, an "homologous recombinant animal" is a non-human 
animal, preferably a mammal, more preferably a mouse, in 
which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the 
25 animal, e.g., an embryonic cell of the animal, prior to 
development of the animal. 

A transgenic animal of the invention can be created by 
introducing nucleic acid encoding a polypeptide of the 
invention (or a homologue thereof) into the male 
30 pronuclei of a fertilized oocyte, e.g., by 

microinjection, retroviral infection, and allowing the 
oocyte co develop in a pseudopregnant female foster 
animal. Intromc sequences and polyadenylation signals 
can also be included in the transgene to increase the 
35 efficiency of expression of the transgene. A tissue- 
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electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al . 
{supra) , and other laboratory manuals. 

For stable transfection of mammalian cells, it is known 
that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 
In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 
markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Cells stably 
transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 
while the other cells die) . 

A host cell of the invention, such as a prokaryotic or 
eukaryotic host cell in culture, can be used to produce a 
polypeptide of the invention. Accordingly, the invention 
further provides methods for producing a polypeptide of 
the invention using the host cells of the invention. In 
one embodiment, the method comprises culturing the host 
cell of invention (into which a recombinant expression 
vector encoding a polypeptide of the invention has been 
introduced) in a suitable medium such that the 
polypeptide is produced. In another embodiment, the 
method further comprises isolating the polypeptide from 
the medium or the host cell. 

The host cells of the invention can also be used to 
produce nonhuman transgenic animals. For example, in one 
embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences 
encoding a polypeptide of the invention have been 
introduced. Such host cells can then be used to create 
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5' and 3' ends by additional nucleic acid of the gene to 
allow for homologous recombination to occur between the 
exogenous gene carried by the vector and an endogenous 
gene in an embryonic stem cell. The additional flanking 
nucleic acid sequences are of sufficient length for 
successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both 
at the 5' and 3' ends) are included in the vector (see, 
e.g., Thomas and Capecchi (1987) Cell 51:503 for a 
) description of homologous recombination vectors) . The 
vector is introduced into an embryonic stem cell line 
(e.g., by electroporation) and cells in which the 
introduced gene has homologously recombined with the 
endogenous gene are selected (see, e.g., Li et al . (1992) 
5 Cell 69:915). The selected cells are then injected into 
a blastocyst of an animal (e.g., a mouse) to form 
aggregation chimeras (see, e.g., Bradley in 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 113- 
0 152) . A chimeric embryo can then be implanted into a 
suitable pseudopregnant female foster animal and the 
embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
5 contain the homologously recombined DNA by germline 

transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous 
recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio /Technology 2:823-829 and in 
0 PCT Publication NOS . WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can 
be produced which contain selected systems which allow 
for regulated expression of the transgene. One example 
S5 of such a system is the cre/loxP recombinase system of 
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specific regulatory sequence (s) can be operably linked to 
the transgene to direct expression of the polypeptide of 
the invention to particular cells. Methods for 
generating transgenic animals via embryo manipulation and 
5 microinjection, particularly animals such as mice, have 
become conventional in the art and are described, for 
example, in U.S. Patent NOS . 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191 and in Hogan, Manipulating the 
Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold 

10 Spring Harbor, N.Y., 1986). Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the 

15 animals. A transgenic founder animal can then be used to 
breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying the transgene can 
further be bred to other transgenic animals carrying 
other transgenes. 

20 To create an homologous recombinant animal, a vector is 
prepared which contains at least a portion of a gene 
encoding a polypeptide of the invention into which a 
deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the gene. In 

2 5 a preferred embodiment, the vector is designed such that, 

upon homologous recombination, the endogenous gene is 
functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" 
vector) . Alternatively, the vector can be designed such 

3 0 that, upon homologous recombination, the endogenous gene 

is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory region 
can be altered to thereby alter the expression of the 
endogenous protein) . In the homologous recombination 
35 vector,, the altered portion of the gene is flanked at its 
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Supplementary active compounds can also be incorporated 
into the compositions. 

A pharmaceutical composition of the invention is 
formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, e.g., intravenous, intradermal, 
subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal , and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 
glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the 
adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use 
include sterile aqueous solutions (where water soluble) 
or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or 
dispersions. For intravenous administration, suitable 
carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF; Parsippany, NJ) or phosphate 
buffered saline (PBS) . In all cases, the composition 
must be sterile and should be fluid to the extent that 
easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be 
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bacteriophage PI. For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al . (1992) Proc . 
Natl. Acad. Sc±. USA 89:6232-6236. Another example of a 
recombinase system is the FLP recombinase system of 
5 Saccharomyces cerevisiae (0' Gorman et al . (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used 
to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase 
and a selected protein are required. Such animals can be 

10 provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one 
containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 
Clones of the non-human transgenic animals described 

15 herein can also be produced according to the methods 

described in Wilmut et al . (1997) Nature 385:810-813 and 
PCT Publication NOS . WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and 

20 antibodies (also referred to herein as "active 

compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. 
Such compositions typically comprise the nucleic acid 
molecule, protein, or antibody and a pharmaceutical^ 

2 5 acceptable carrier. As used herein the language 

"pharmaceutical ly acceptable carrier" is intended to 
include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with 

30 pharmaceutical administration. The use of such media and 
agents for pharmaceutical^ active substances is well 
known in the art. Except insofar as any conventional 
media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated. 
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Oral compositions generally include an inert diluent or 
an edible carrier. They can be enclosed in gelatin 
capsules or compressed into tablets. For the purpose of 
oral therapeutic administration, the active compound can 
be incorporated with excipients and used in the form of 
tablets, troches, or capsules. Oral compositions can 
also be prepared using a fluid carrier for use as a 
mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 
Pharmaceutical!*/ compatible binding agents, and/or 
adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 
the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring . 

For administration by inhalation, the compounds are 
delivered in the form of an aerosol spray from a 
pressurized container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

Systemic administration can also be by transmucosal or 
transdermal means. For tftansmucosal or transdermal 
administration, penetrants appropriate to the barrier to 
be permeated are used in the formulation. Such 
penetrants are generally known in the art, and include, 
for example, fcr zransmucosal administration, detergents, 
bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of 
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preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier 
can be a solvent: or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, 
5 propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by 
10 the use of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the 
like. In many cases, it will be preferable to include 
15 isotonic agents, for example, sugars^ polyalcohols such 
as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the 
composicion an agent which delays absorption, for 
2 0 example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a polypeptide or 
antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients 
25 enumerated above, as required, followed by filtered 

sterilization. Generally, dispersions are prepared by 
incorporating the active compound into a sterile vehicle 
which contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the 
30 case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of 
preparation are vacuum drying and f reeze-drying which 
yields a powder of the active ingredient plus any 
additional desired ingredient from a previously sterile- 
35 filtered solution thereof. 
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the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 
particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
5 active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 
100 mg/kg of body weight (generally 10 mg/kg to 20 
mg/kg) . If the antibody is to act in the brain, a dosage 
of 50 mg/kg to 100 mg/kg is usually appropriate. 
10 Generally, partially human antibodies and fully human 

antibodies have a longer half-life within the human body 
than other antibodies. Accordingly, lower dosages and 
less frequent administration is often possible. 
Modifications such as lipidation can be used to stabilize 
15 antibodies and to enhance uptake and tissue penetration 
(e.g., into the brain) . A method for lipidation of 
antibodies is described by Cruikshank et al . ((1997) J. 
Acquired Immune Deficiency Syndromes and Human 
Retrovirology 14:193) . 
20 The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors. 
Gene therapy vectors can be delivered to a subject by, 
for example,, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
2 5 (see, e.g., Chen et al . (1994) Proc . Natl. Acad. Sci . USA 
91:3054-3057). The pharmaceutical preparation of the 
gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
30 Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can 
include one or more cells which produce the gene delivery 
system. 
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nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in 
the art . 

5 The compounds can also be prepared in the form of 

suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared 

10 with carriers that will protect the compound against 
rapid elimination from the body, such as a controlled 
release formulation, including implants and 
microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene 

15 vinyl acetate, polyanhydrides, polyglycolic acid, 

collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova 

20 Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in 

25 the art, for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or 
parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 

30 form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 
compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 

35 carrier. The specification for the dosage unit forms of 
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A . Screening Assays 

The invention provides a method (also referred to 
herein as a "screening assay") for identifying 
modulators, i.e., candidate or test compounds or agents 
5 (e.g., peptides, pept idomimetics , small molecules or 
other drugs) which bind to polypeptide of the invention 
or have a stimulatory or inhibitory effect on, for 
example, expression or activity of a polypeptide of the 
invention. 

10 In one embodiment, the invention provides assays for 
screening candidate or test compounds which bind to or 
modulate the accivity of the membrane -bound form of a 
polypeptide of zhe invention or biologically active 
portion thereof. The test compounds of the present 
15 invention can be obtained using any of the numerous 

approaches in combinatorial library methods known in the 
art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase 
libraries; synthetic library methods requiring 
20 deconvolution; the "one-bead one -compound" library 
method; and synthetic library methods using affinity 
chromatography selection. The biological library 
approach is limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide 
25 oligomer or small molecule libraries of compounds (Lam 
(1997) Anticancer Drug Des. 12:145) . 

Examples of methods for the synthesis of molecular 
libraries can be found in the art, for example in: 
DeWitt et al . (1993) Proc. Natl. Acad. Sci . USA 90:6909; 
30 Erb et al . (1994) Proc. Natl. Acad. Sci. USA 91:11422; 
Zuckermann et al. (1994). J. Med. Chem. 37:2678; Cho et 
al. (1993) Science 261:1303; Carrell et al . (1994) Angew. 
Chem. Int. Ed. Engl. 33:2059; Carell et al . (1994) Angew. 
Chew. Int. Ed. Engl- 33:2061; and Gallop et al . (1994) J. 
35 Med. Chem. 37:1233. 
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The pharmaceutical compositions can be included in a 
container, pack, or dispenser together with instructions 
for administration. 



V. Uses and Methods of the Invention 
5 The nucleic acid molecules, proteins, protein 

homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 
assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) ; c) predictive medicine 

10 (e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). For 
example, polypeptides of the invention can to used to (i) 
modulate cellular proliferation; (ii) modulate cellular 

15 differentiation; and (iii) modulate cell survival. The 
isolated nucleic acid molecules of the invention can be 
used to express proteins (e.g., via a recombinant 
expression vector in a host cell in gene therapy 
applications), to detect mRNA (e.g., in a biological 

20 sample) or a genetic lesion, and to modulate activity of 
a polypeptide of the invention. In addition, the 
polypeptides of the invention can be used to screen drugs 
or compounds which modulate activity or expression of a 
polypeptide of the invention as well as to treat 

25 disorders characterized by insufficient or excessive 

production of a protein of the invention or production 
of a form of a protein of the invention which has 
decreased or aberrant activity compared to the wild type 
protein. In addition, the antibodies of the invention 

3 0 can be used to detect and isolate a protein of the and 
modulate activity of a protein of the invention. 

This invention further pertains to novel agents 
identified by the above-described screening assays and 
uses thereof for treatments as described herein. 
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invention, or a biologically active portion thereof, on 
the cell surface with a known compound which binds the 
polypeptide to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the 
5 ability of the test compound to interact with the 

polypeptide, wherein determining the ability of the test 
compound to interact with the polypeptide comprises 
determining the ability of the test compound to 
preferentially bind to the polypeptide or a biologically 
10 active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell -based assay 
comprising contacting a cell expressing a membrane -bound 
form of a polypeptide of the invention, or a biologically 
active portion thereof, on the cell surface with a test 
15 compound and determining the ability of the test compound 
to modulate (e.g., stimulate or inhibit) the activity of 
the polypeptide or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of the polypeptide or a biologically active 
20 portion thereof can be accomplished, for example, by 
determining the ability of the polypeptide protein to 
bind to or interact with a target molecule. 

Determining the ability of a polypeptide of the 
invention to bind to or interact with a target molecule 
25 can be accomplished by one of the methods described above 
for determining direct binding. As used herein, a 
"target molecule" is a molecule with which a selected 
polypeptide (e.g., a polypeptide of the invention binds 
or interacts with in nature, for example, a molecule on 
30 the surface of a cell which expresses the selected 

protein, a molecule on the surface of a second cell, a 
molecule in the extracellular milieu, a molecule 
associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A target molecule can be a 
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Libraries of compounds may be presented in solution 
(e.g., Houghten (1992) Bio/Techniques 13:412-421), or on 
beads (Lam (1991) Nature 354 : 82-84) , chips (Fodor (1993) 
Nature 364:555-556), bacteria (U.S. Patent No. 
5 5,223,409), spores (Patent NOS. 5,571,698; 5,403,484; and 
5,223,409), plasmids (Cull et al . (1992) Proc. Natl. 
Acad, Sci. USA 89:1865-1869) or phage (Scott and Smith 
(1990) Science 249^386-390; Devlin (1990) Science 
249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 
10 USA 87:6378-6382; and Felici (1991) J. Mol . Biol. 
222 :301-310) . 

In one embodiment, an assay is a cell-based assay in 
which a cell which expresses a membrane- bound form of a 
polypeptide of the invention, or a biologically active 

15 portion thereof, on the cell surface is contacted with a 
test compound and the ability of the test compound to 
bind to the polypeptide determined. The cell, for 
example, can be a yeast cell or a cell of mammalian 
origin. Determining the ability of the test compound to 

20 bind to the polypeptide can be accomplished, for example, 
by coupling the test compound with a radioisotope or 
enzymatic label such that binding of the test compound to 
the polypeptide or biologically active portion thereof 
can be determined by detecting the labeled compound in a 

2 5 complex. For example, test compounds can be labeled with 

125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and 
the radioisotope detected by direct counting of 
radioemmission or by scintillation counting. 
Alternatively, test compounds can be enzymatically 

3 0 labeled with, for example, horseradish peroxidase, 

alkaline phosphatase, or luciferase, and the enzymatic 
label detected by determination of conversion of an 
appropriate substrate to product. In a preferred 
embodiment, the assay comprises contacting a cell which 
3 5 expresses a membrane -bound form of a polypeptide of the 
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polypeptide to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the 
ability cf the test compound to interact with the 
polypeptide, wherein determining the ability of the test 
5 compound to interact with the polypeptide comprises 
determining the ability of the test compound to 
preferentially bind to the polypeptide or biologically 
active portion thereof as compared to the known compound. 
In another embodiment, an assay is a cell -free assay 
0 comprising contacting a polypeptide of the invention or 
biologically active portion thereof with a test compound 
and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the 
polypeptide or biologically active portion thereof. 
5 Determining the ability of the test compound to modulate 
the activity of the polypeptide can be accomplished, for 
example, by determining the ability of the polypeptide to 
bind to a target molecule by one of the methods described 
above for determining direct binding. In an alternative 
0 embodiment, determining the ability of the test compound 
to modulate the activity of the polypeptide can be 
accomplished by determining the ability of the 
polypeptide of the invention to further modulate the 
target molecule. For example, the catalytic/enzymatic 
5 activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 
comprises contacting a polypeptide of the invention or 
biologically active portion thereof with a known compound 
0 which binds the polypeptide to form an assay mixture, 
contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with the polypeptide, wherein determining the ability of 
the test compound to interact with the polypeptide 
5 comprises determining the ability of the polypeptide to 
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polypeptide of the invention or some other polypeptide or 
protein. For example, a target molecule can be a 
component of a signal transduction pathway which 
facilitates transduction of an extracellular signal 
5 (e.g., a signal generated by binding of a compound to a 
polypeptide of the invention) through the cell membrane 
and into the cell or a second intercellular protein which 
has catalytic activity or a protein which facilitates the 
association of downstream signaling molecules with a 

10 polypeptide of the invention. Determining the ability of 
a polypeptide of the invention to bind to or interact 
with a target molecule can be accomplished by determining 
the activity of the target molecule. For example, the 
activity of the target molecule can be determined by 

15 detecting induction of a cellular second messenger of the 
target (e.g., intracellular Ca 2 \ diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a regulatory element 

20 that is responsive to a polypeptide of the invention 

operably linked to a nucleic acid encoding a detectable 
marker, e.g. luciferase) , or detecting a cellular 
response, for example, cellular differentiation, or cell 
proliferation. 

25 In yet another embodiment, an assay of the present 
invention is a cell -free assay comprising contacting a 
polypeptide of the invention or biologically active 
portion thereof with a test compound and determining the 
ability of the test compound to bind to the polypeptide 

30 or biologically active portion thereof. Binding of the 
test compound to the polypeptide can be determined either 
directly or indirectly as described above. In a 
preferred embodiment, the assay includes contacting the 
polypeptide of the invention or biologically active 

3 5 portion thereof with a known compound which binds the 
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Chemical; St. Louis, MO) or glutathione derivatized 
microtitre plates, which are then combined with the test 
compound or the test compound and either the non- adsorbed 
target protein or A polypeptide of the invention, and the 
5 mixture incubated under conditions conducive to complex 
formation (e.g., at physiological conditions for salt and 
pH) . Following incubation, the beads or microtitre plate 
wells are washed to remove any unbound components and 
complex formation is measured either directly or 
10 indirectly, for example, as described above. 

Alternatively, the complexes can be dissociated from the 
matrix, and the level of binding or activity of the 
polypeptide of the invention can be determined using 
standard techniques. 
15 Other techniques for immobilizing proteins on matrices 
can also be used in the screening assays of the 
invention. For example, either the polypeptide of the 
invention or its target molecule can be immobilized 
utilizing conjugation of biotin and streptavidin . 
20 Biotinyiated polypeptide of the invention or target 
molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques well known in the art 
(e.g., biotinylation kit, Pierce Chemicals; Rockford, 
IL), and immobilized in the wells of streptavidin-coated 
25 96 well plates (Pierce Chemical) . Alternatively, 

antibodies reactive with the polypeptide of the invention 
or target molecules but which do not interfere with 
binding of the polypeptide of the invention to its target 
molecule can be derivatized to the wells of the plate, 
30 and unbound target or polypeptidede of the invention 
trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those 
described above for the GST -immobilized complexes, 
include immunodetection of complexes using antibodies 
35 reactive with the polypeptide of the invention or target 
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preferentially bind to or modulate the activity of a 
target molecule. 

The cell -free assays of the present invention are 
amenable to use of both a soluble form or the membrane- 
5 bound form of a polypeptide of the invention. in the 
case of cell-free assays comprising the membrane -bound 
form of the polypeptide, it may be desirable to utilize a 
solubilizing agent such that the membrane -bound form of 
the polypeptide is maintained in solution. Examples of 

10 such solubilizing agents include non- ionic detergents 
such as n-octylglucoside, n-dodecylglucoside , n- 
dodecylmaltoside, octanoyl-N-methylglucamide , decanoyl-N- 
methylglucamide, Triton X-100, Triton X-114, Thesit, 
Isotridecypoly (ethylene glycol ether) n, 3-[(3- 

15 cholamidopropyl) dimethylamminio] -1-propane sulfonate 
(CHAPS) , 3 - [ (3 -cholamidopropyl) dimethylamminio] -2- 
hydroxy- 1-propane sulfonate (CHAPSO) , or N-dodecyl=N, N- 
dimethyl -3 -ammonio- 1-propane sulfonate . 

In more than one embodiment of the above assay methods 

20 of the present invention, it may be desirable to 

immobilize either the polypeptide of the invention or its 
target molecule to facilitate separation of complexed 
from uncomplexed forms of one or both of the proteins, as 
well as to accommodate automation of the assay. Binding 

25 of a test compound to the polypeptide, or interaction of 
the polypeptide with a target molecule in the presence 
and absence of a candidate compound, can be accomplished 
in any vessel suitable for containing the reactants. 
Examples of such vessels include microtitre plates, test 

30 tubes, and micro-centrifuge tubes. In one embodiment, a 
fusion protein can be provided which adds a domain that 
allows one or both of the proteins to be bound to a 
matrix. For example, glutathione-S- transferase fusion 
proteins or glutathione-S- transferase fusion proteins can 

3 5 be adsorbed onto glutathione sepharose beads (Sigma 
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No. WO 94/10300), to identify other proteins, which bind 
to or interact with the polypeptide of the invention and 
modulate activity of the polypeptide of the invention. 
Such binding proteins are also likely to be involved in 
5 the propagation of signals by the polypeptide of the 
inventions as, for example, upstream or downstream 
elements of a signaling pathway involving the polypeptide 
of the invention. 

This invention further pertains to novel agents 
10 identified by the above-described screening assays and 
uses thereof for treatments as described herein. 

B . Detection Assays 

Portions or fragments of the cDNA sequences identified 
herein (and the corresponding complete gene sequences) 

15 can be used in numerous ways as polynucleotide reagents. 
For example, these sequences can be used to: (i) map 
their respective genes on a chromosome and, thus, locate 
gene regions associated with genetic disease; (ii) 
identify an individual from a minute biological sample 

20 (tissue typing) ; and (iii) aid in forensic identification 
of a biological sample. These applications are described 
in the subsections below. 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a 
2 5 gene has been isolated, this sequence can be used to map 
the location of the gene on a chromosome. Accordingly, 
nucleic acid molecules described herein or fragments 
thereof, can be used to map the location of the 
corresponding genes on a chromosome. The mapping of the 
30 sequences to chromosomes is an important first step in 
correlating these sequences with genes associated with 
disease . 
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molecule, as well as enzyme - linked assays which rely on 
detecting an enzymatic activity associated with the 
polypeptide of the invention or target molecule. 

In another embodiment, modulators of expression of a 
5 polypeptide of the invention are identified in a method 
in which a cell is contacted with a candidate compound 
and the expression of the selected mRNA or protein (i.e., 
the mRNA or protein corresponding to a polypeptide or 
nucleic acid of the invention) in the cell is determined. 

10 The level of expression of the selected mRNA or protein 
in the presence of the candidate compound is compared to 
the level of expression of the selected mRNA or protein 
in the absence of the candidate compound. The candidate 
compound can then be identified. as a modulator of 

15 expression of the polypeptide of the invention based on 
this comparison. For example, when expression of the 
selected mRNA or protein is greater (statistically 
significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is 

20 identified as a stimulator of the selected mRNA or 

protein expression. Alternatively, when expression of 
the selected mRNA or protein is less (statistically 
significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is 

25 identified as an inhibitor of the selected mRNA or 

protein expression. The level of the selected mRNA or 
protein expression in the cells can be determined by 
methods described herein. 

In yet another aspect of the invention, a polypeptide 

30 of the inventions can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 72:223- 
232; Madura et al . (1993) J\ Biol. Chem. 268:12046-12054; 
Bartel et al . (1993) Bio/Techniques 14:920-924; Iwabuchi 

35 e: al . (1993) Oncogene 8:1693-1696; and PCT Publication 
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marking multiple sites and/or multiple chromosomes. 
Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene 
5 families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic 

10 map data. (Such data are found, for example, in V. 

McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
same chromosomal region, can then be identified through 

15 linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) Mature 
325 :783-787. 

Moreover, differences in the DNA sequences between 
individuals affected and unaffected with a disease 

20 associated with a gene of the invention can be 

determined. If a mutation is observed in some or all of 
the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the 
causative agent of the particular disease. Comparison of 

25 affected and unaffected individuals generally involves 
first looking for structrural alterations in the 
chromosomes such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR 
based on that DNA sequence. Ultimately, complete 

3 0 sequencing of genes from several individuals can be 

performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 
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Briefly, genes can be mapped to chromosomes by 
preparing PCR primers {preferably 15-2 5 bp in length) 
from the sequence of a gene of the invention. Computer 
analysis of the sequence of a gene of the invention can 
5 be used to rapidly select primers that do not span more 
than one exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used 
for PCR screening of somatic cell hybrids containing 
individual human chromosomes . Only those hybrids 

10 containing the human gene corresponding to the gene 

sequences will yield an amplified fragment. For a review 
of this technique, see D'Eustachio et al . ((1983) Science 
220 : 919-924) . 

PCR mapping of somatic cell hybrids is a rapid 

15 procedure for assigning a particular sequence to a 

particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using 
the nucleic acid sequences of the invention to design 
oligonucleotide primers, sublocalization can be achieved 

2 0 with panels of fragments from specific chromosomes. 

Other mapping strategies which can similarly be used to 
map a gene to its chromosome include in situ 
hybridization (described in Fan et al . (1990) Proc. Natl. 
Acad. Sci. USA 87:6223-27), pre- screening with labeled 
25 flow-sorted chromosomes (CITE) / and pre-selection by 
hybridization to chromosome specific cDNA libraries 
(CITE) . Fluorescence in situ hybridization (FISH) of a 
DNA sequence to a metaphase chromosomal spread can 
further be used to provide a precise chromosomal location 

3 0 in one step. For a review of this technique, see Verma 

et; al . , (Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 
individually to mark a single chromosome or a single site 
35 on that chromosome, or panels of reagents can be used for 
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individual humans occurs with a frequency of about once 
per each 500 bases. Each of the sequences described 
herein can, to some degree, be used as a standard against 
which DNA from an individual can be compared for 
5 identification purposes. Because greater numbers of 
polymorphisms occur in the noncoding regions, fewer 
sequences are necessary to differentiate individuals. 
The noncoding sequences of SEQ ID NO:l can comfortably 
provide positive individual identification with a panel 

10 of perhaps 10 to 1,000 primers which each yield a 

noncoding amplified sequence of 100 bases. If predicted 
coding sequences, such as those in SEQ ID NO: 3 are used, 
a more appropriate number of primers for positive 
individual identification would be 500-2,000. 

15 If a panel of reagents from the nucleic acid sequences 
described herein is used to generate a unique 
identification database for an individual, those same 
reagents can later be used to identify tissue from that 
individual. Using the unique identification database, 

20 positive identification of the individual, living or 
dead, can be made from extremely small tissue samples. 

3. Use of Partial Gene Sequences in Forensic Biology 
DNA-based identification techniques can also be used in 
forensic biology. Forensic biology is a scientific field 

25 employing genetic typing of biological evidence found at 
a crime scene as a means for positively identifying, for 
example, a perpetrator of a crime. To make such an 
identification, PCR technology can be used to amplify DNA 
sequences taken from very small biological samples such 

30 as tissues, e.g., hair or skin, or body fluids, e.g., 
blood, saliva, or semen found at a crime scene. The 
amplified sequence can then be compared to a standard, 
thereby allowing identification of the origin of the 
biological sample. 
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2. Tissue Typing 

The nucleic acid sequences of the present invention can 
also be used to identify individuals from minute 
biological samples. The United States military, for 
5 example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 
personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
probed on a Southern blot to yield unique bands for 

10 identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification 
difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 

15 U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can 
be used to provide an alternative technique which 
determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 

20 nucleic acid sequences described herein can be used to 
prepare two PGR primers from the 5' and 3' ends of the 
sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, 

2 5 prepared in this manner, can provide unique individual 
identifications, as each individual will have a unique 
set of such DNA sequences due to allelic differences. 
The sequences of the present invention can be used to 
obtain such identification sequences from individuals and 

30 from tissue. The nucleic acid sequences of the invention 
uniquely represent portions of the human genome. Allelic 
variation occurs to some degree in the coding regions of 
these sequences, and to a greater degree in the noncoding 
regions. It is estimated that allelic variation between 
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to diagnostic assays for determining expression of a 
polypeptide or nucleic acid of the invention and/or 
activity of a polypeptide of the invention, in the 
context of a biological sample (e.g., blood, serum, 
5 cells, tissue) to thereby determine whether an individual 
is afflicted with a disease or disorder, or is at risk of 
developing a disorder, associated with aberrant 
expression or activity of a polypeptide of the invention. 
The invention also provides for prognostic (or 

10 predictive) assays for determining whether an individual 
is at risk of developing a disorder associated with 
aberrant expression or activity of a polypeptide of the 
invention. For example, mutations in a gene of the 
invention can be assayed in a biological sample. Such 

15 assays can be used for prognostic or predictive purpose 
to thereby prophylactically treat an individual prior to 
the onset of a disorder characterized by or associated 
with aberrant expression or activity of a polypeptide of 
the invention. 

20 Another aspect of the invention provides methods for 
expression of a nucleic acid or polypeptide of the 
invention or accivity of a polypeptide of the invention 
in an individual co thereby select appropriate 
therapeutic or prophylactic agents for that individual 

25 (referred to herein as "pharmacogenomics" ) . 

Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g., the genotype of tha individual examined to 

30 determine the ability of the individual to respond to a 
particular agent) . 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of a polypeptide 

35 of the invention in clinical trials. 
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The sequences of the present invention can be used to 
provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic 
5 identifications by, for example, providing another 

"identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
identification as an accurate alternative to patterns 

10 formed by restriction enzyme generated fragments. 

Sequences targeted to noncoding regions are particularly 
appropriate for this use as greater numbers of 
polymorphisms occur in the noncoding regions, making it 
easier to differentiate individuals using this technique. 

15 Examples of polynucleotide reagents include the nucleic 
acid sequences of the invention or portions thereof, 
e.g., fragments derived from noncoding regions having a 
length of at least 20 or 30 bases. 

The nucleic acid sequences described herein can further 

20 be used to provide polynucleotide reagents, e.g., labeled 
or labelable probes which can be used in, for example, an 
in situ hybridization technique, to identify a specific 
tissue, e.g., brain tissue. This can be very useful in 
cases where a forensic pathologist is presented with a 

25 tissue of unknown origin. Panels of such probes can be 
used to identify tissue by species and/or by organ type. 

C. Predictive Medicine 

The present invention also pertains to the field of 
predictive medicine in which diagnostic assays, 
3 0 prognostic assays, pharmacogenomics, and monitoring 
clinical trails are used for prognostic (predictive) 
purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
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indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples 
of indirect labeling include detection of a primary 
antibody using a f luorescently labeled secondary antibody 
5 and end-labeling of a DNA probe with biotin such that it 
can be detected with f luorescently labeled streptavidin. 
The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present 

10 within a subject. That is, the detection method of the 
invention can be used to detect mRNA, protein, or genomic 
DNA in a biological sample in vitro as well as in vivo. 
For example, in vitro techniques for detection of mRNA 
include Northern hybridizations and in situ 

15 hybridizations. In vitro techniques for detection of A 
polypeptide of the invention include enzyme linked 
immunosorbent assays (ELISAs) , Western blots, 
immunoprecipitat ions and immunofluorescence. In vitro 
techniques for detection of genomic DNA include Southern 

20 hybridizations. Furthermore, in vivo techniques for 
detection of a polypeptide of the invention include 
introducing into a subject a labeled antibody directed 
against the polypeptide. For example, the antibody can 
be labeled with a radioactive marker whose presence and 

25 location in a subject can be detected by standard imaging 
techniques . 

In one embodiment, the biological sample contains 
protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
3 0 test subject or genomic DNA molecules from the test 

subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 
from a subject . 

In another embodiment, the methods further involve 
3 5 obtaining a control biological sample from a control 
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These and other agents are described in further detail 
in the following sections. 



1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
5 absence of a polypeptide or nucleic acid of the invention 
in a biological sample involves obtaining a biological 
sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting a 
polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of 

10 the invention such that the presence of a polypeptide or 
nucleic acid of the invention is detected in the 
biological sample. A preferred agent for detecting mRNA 
or genomic DNA encoding a polypeptide of the invention is 
a labeled nucleic acid probe capable of hybridizing to 

15 mRNA or genomic DNA encoding a polypeptide of the 

invention. The nucleic acid probe can be, for example, a 
full-length cDNA, such as the nucleic acid of SEQ ID N0:1 
or 4, or a portion thereof, such as an oligonucleotide of 
at least 15, 30, 50, 100, 250 or 500 nucleotides in 

20 length and sufficient to specifically hybridize under 

stringent conditions to a mRNA or genomic DNA encoding a 
polypeptide of the invention. Other suitable probes for 
use in the diagnostic assays of the invention are 
described herein. 

25 A preferred agent for detecting A polypeptide of the 
invention is an antibody capable of binding to A 
polypeptide of the invention, preferably an antibody with 
a detectable label. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a 

3 0 fragment thereof (e.g., Fab or F(ab') 2 > can be used. The 
term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or 
antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as 
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For oligonucleotide-based kits, the kit may comprise, 
for example: (1) an oligonucleotide, e.g., a detectably 
labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide of the invention or 
5 (2) a pair of primers useful for amplifying a nucleic 
acid molecule encoding a polypeptide of the invention. 

The kit may also comprise, e.g., a buffering agent, a 
preservative, or a protein stabilizing agent. The kit 
may also comprise components necessary for detecting the 

10 detectable agent (e.g., an enzyme or a substrate). The 
kit may also contain a control sample or a series of 
control samples which can be assayed and compared to the 
test sample contained. Each component of the kit is 
usually enclosed within an individual container and all 

15 of the various containers are within a single package 

along with instructions for observing whether the tested 
subject is suffering from or is at risk of developing a 
disorder associated with aberrant expression of the 
polypeptide. 

2 0 2 . Prognostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
disorder associated with aberrant expression or activity 

2 5 of a polypeptide of the invention. For example, the 

assays described herein, such as the preceding diagnostic 
assays or the following assays, can be utilized to 
identify a subject having or at risk of developing a 
disorder associated with aberrant expression or activity 

30 of a polypeptide of the invention. Alternatively, the 
prognostic assays can be utilized to identify a subject 
having or at risk for developing such a disease or 
disorder. Thus, the present invention provides a method 
in which a test sample is obtained from a subject and a 
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subject, contacting the control sample with a compound or 
agent capable of detecting a polypeptide of the invention 
or mRNA or genomic DNA encoding a polypeptide of the 
invention, such that the presence of the polypeptide or 
5 mRNA or genomic DNA encoding the polypeptide is detected 
in the biological sample, and comparing the presence of 
the polypeptide or mRNA or genomic DNA encoding the 
polypeptide in the control sample with the presence of 
the polypeptide or mRNA or genomic DNA encoding the 

10 polypeptide in the test sample. 

The invention also encompasses kits for detecting the 
presence of a polypeptide or nucleic acid of the 
invention in a biological sample (a test sample) . Such 
kits can be used to determine if a subject is suffering 

15 from or is at increased risk of developing a disorder 
associated with aberrant expression of a polypeptide of 
the invention (e.g., an immunological disorder). For 
example, the kit can comprise a labeled compound or agent 
capable of detecting the polypeptide or mRNA encoding the 

2 0 polypeptide in a biological sample and means for 

determining the amount of the polypeptide or mRNA in the 
sample (e.g., an antibody which binds the polypeptide or 
an oligonucleotide probe which binds to DNA or mRNA 
encoding the polypeptide) . Kits may also include 
25 instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 
associated with aberrant expression of the polypeptide if 
the amount of the polypeptide or mRNA encoding the 
polypeptide is above or below a normal level. 

3 0 For antibody-based kits, the kit may comprise, for 

example: (1) a first antibody (e.g., attached to a solid 
support) which binds to a polypeptide of the invention; 
and, optionally, (2) a second, different antibody which 
binds to either the polypeptide or the first antibody and 
3 5 is conjugated to a detectable agent. 
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In preferred embodiments, the methods include detecting, 
in a sample of cells from the subject, the presence or 
absence of a genetic lesion or mutation characterized by 
at least one of an alteration affecting the integrity of 
a gene encoding the polypeptide of the invention, or the 
mis-expression of the gene encoding the polypeptide of 
the invention. For example, such genetic lesions or 
mutations can be detected by ascertaining the existence 
of at least one of: Da deletion of one or more 
nucleotides from the gene; 2) an addition of one or more 
nucleotides to the gene; 3) a substitution of one or more 
nucleotides of the gene; 4) a chromosomal rearrangement 
of the gene; 5) an alteration in the level of a messenger 
RNA transcript of the gene; 6) an aberrant modification 
of the gene, such as of the methylation pattern of the 
genomic DNA; 7) the presence of a non-wild type splicing 
pattern of a messenger RNA transcript of the gene; 8) a 
non-wild type level of a the protein encoded by the gene; 
9) an allelic loss of the gene; and 10) an inappropriate 
post-translational modification of the protein encoded by 
the gene. As described herein, there are a large number 
of assay techniques known in the art which can be used 
for detecting lesions in a gene. 

In certain embodiments, detection of the lesion 
involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Patent Nos . 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al . (1988) Science 241:1077-1080; and 
Nakazawa et al . (1994) Proc. Natl. Acad. Sci . USA 91:360- 
364), the latter of which can be particularly useful for 
detecting point mutations in a gene (see, e.g., Abravaya 
ec al. (1995) Nucleic Acids Res. 23:675-682). This 
method can include the steps of collecting a sample of 
cells from a patient, isolating nucleic acid (e.g., 
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polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of 
the invention is detected, wherein the presence of the 
polypeptide or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 
5 associated with aberrant expression or activity of the 
polypeptide. As used herein, a "test sample" refers to a 
biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. 

10 Furthermore, the prognostic assays described herein can 
be used to determine whether a subject can be 
administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 

15 disorder associated with aberrant expression or activity 
of a polypeptide of the invention. For example, such 
methods can be used to determine whether a subject can be 
effectively treated with a specific agent or class of 
agents (e.g., agents of a type which decrease activity of 

2 0 the polypeptide) . Thus, the present invention provides 
methods for determining whether a subject can be 
effectively treated with an agent for a disorder 
associated with aberrant expression or activity of a 
polypeptide of the invention in which a test sample is 

2 5 obtained and the polypeptide or nucleic acid encoding the 

polypeptide is detected (e.g., wherein the presence of 
the polypeptide or nucleic acid is diagnostic for a 
subject that can be administered the agent to treat a 
disorder associated with aberrant expression or activity 

3 0 of the polypeptide) . 

The methods of the invention can also be used to detect 
genetic lesions or mutations in a gene of the invention, 
thereby determining if a subject with the lesioned gene 
is at risk for a disorder characterized aberrant 
3 5 expression or activity of a polypeptide of the invention. 



WO 00/18800 



PCTAJS99/22818 



- 88 - 

specific mutations by development or loss of a ribozyme 
cleavage site. 

In other embodiments, genetic mutations can be 
identified by hybridizing a sample and control nucleic 

5 acids, e.g., DNA or RNA, to high density arrays 

containing hundreds or thousands of oligonucleotides 
probes (Cronin et al . (1996) Human Mutation 7:244-255; 
Kozal et al . (1996) Nature Medicine 2:753-759). For 
example, genetic mutations can be identified in two- 

0 dimensional arrays containing light -generated DNA probes 
as described in Cronin et al . , supra. Briefly, a first 
hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify 
base changes between the sequences by making linear 

5 arrays of sequential overlapping probes. This step 

allows the identification of point mutations. This step 
is followed by a second hybridization array that allows 
the characterization of specific mutations by using 
smaller, specialized probe arrays complementary to all 

0 variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to the 
wild- type gene and the other complementary to the mutant 
gene . 

In yet another embodiment, any of a variety of 
5 sequencing reactions known in the art can be used to 
directly sequence the selected gene and detect mutations 
by comparing, the sequence of the sample nucleic acids 
with the corresponding wild- type (control) sequence. 
Examples of sequencing reactions include those based on 
0 techniques developed by Maxim and Gilbert ((1977) Proc. 
Natl. Acad. Sci. USA 74:560) or Sanger ((1977) Proc. 
Natl. Acad. Sci. USA 74:5463). It is also contemplated 
that any of a variety of automated sequencing procedures 
can be utilized when performing the diagnostic assays 
5 ((1995) Bio/Techniques 19:448), including sequencing by 
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genomic, mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more 
primers which specifically hybridize to the selected gene 
under conditions such that hybridization and 
5 amplification of the gene (if present) occurs, and 
detecting the presence or absence of an amplification 
product, or detecting the size of the amplification 
product and comparing the length to a control sample. It 
is anticipated that PCR and/or LCR may be desirable to 

10 use as a preliminary amplification step in conjunction 
with any of the techniques used for detecting mutations 
described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 

15 Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 
Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 

20 detection of the amplified molecules using techniques 

well known to those of skill in the art. These detection 
schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in 
very low numbers . 

25 In an alternative embodiment, mutations in a selected 
gene from a sample cell can be identified by alterations 
in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 

30 endonucleases, and fragment length sizes are determined 
by gel electrophoresis and compared. Differences in 
fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 

35 No. 5,498,531) can be used to score for the presence of 
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E. coli cleaves A at G/A mismatches and the thymidine DNA 
glycosylase from HeLa cells cleaves T at G/T mismatches 
(Hsu et al. (1994) Carcinogenesis 15:1657-1662). 
According to an exemplary embodiment, a probe based on a 
5 selected sequence, e.g., a wild-type sequence, is 

hybridized to a cDNA or other DNA product from a test 
cell(s) . The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. 

10 See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic 
mobility will be used to identify mutations in genes. 
For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in 

15 electrophoretic mobility between mutant and wild type 

nucleic acids (Orita et al . (1989) Proc. Natl. Acad. Sci . 
USA 86:2766; see also Cotton (1993) Mutat. Res. 285:125- 
144/ Hayashi (1992) Genet. Anal. Tech. Appl . 9:73-79). 
Single -stranded DNA fragments of sample and control 

20 nucleic acids will be denatured and allowed to renature . 
The secondary structure of single-stranded nucleic acids 
varies according to sequence, and the resulting 
alteration in electrophoretic mobility enables the 
detection of even a single base change. The DNA 

25 fragmencs may be labeled or detected with labeled probes. 
The sensitivity of the assay may be enhanced by using RNA 
(rather than DNA) , in which the secondary structure is 
more sensitive to a change in sequence. In a preferred 
embodiment, the subject method utilizes heteroduplex 

3 0 analysis to separate double stranded heteroduplex 

molecules on the basis of changes in electrophoretic 
mobility (Keen et al . (1991) Trends Genet. 7:5). 

In yet another embodiment, the movement of mutant or 
wild- type fragments in polyacrylamide gels containing a 

3 5 gradient of denaturant is assayed using denaturing 
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mass spectrometry (see, e.g., PCT Publication No. WO 
94/16101; Cohen et al . (1996) Adv. Chromatogrr . 36:127- 
162; and Griffin et al . (1993) Appl . Biochem. Biotechnol . 
38:147-159) . 

5 Other methods for detecting mutations in a selected 
gene include methods in which protection from cleavage 
agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al . (1985) Science 
230:1242). In general, the technique of "mismatch 

10 cleavage" entails providing heteroduplexes formed by 

hybridizing (labeled) RNA or DNA containing the wild- type 
sequence with potentially mutant RNA or DNA obtained from 
a tissue sample. The double -stranded duplexes are 
treated with an agent which cleaves single- stranded 

15 regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample 
strands. RNA/DNA duplexes can be treated with RNase to 
digest mismatched regions, and DNA/DNA hybrids can be 
treated with SI nuclease to digest mismatched regions. 

20 In other embodiments, either DNA/DNA or RNA/DNA duplexes 
can be treated with hydroxylamine or osmium tetroxide and 
with piperidine in order to digest mismatched regions. 
After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing 

25 polyacrylamide gels to determine the site of mutation. 
See, e.gr., Cotton et al . (1988) Proc. Natl. Acad. Sci . 
USA 85:4397; Saleeba et al . (1992) Methods Enzymol . 
217:286-295. In a preferred embodiment, the control DNA 
or RNA can be labeled for detection. 

30 In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 
mismatched base pairs in double- stranded DNA (so called 
"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in cDNAs obtained 

35 from samples of cells. For example, the mutY enzyme of 
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gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does no: 
completely denature, for example by adding a *GC clamp of 
5 approximately 4 0 bp of high-melting GC-rich DNA by PCR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys . Chem. 265:12753). 
0 Examples of other techniques for detecting point 
mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 
5 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saijci et al. (1986) 
Nature 324:163); Saiki et al . (1989) Proc. Natl. Acad. 
Sci . USA 86:6230). Such allele specific oligonucleotides 
0 are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification technology 
5 which depends on selective PCR amplification may be used 
in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 
0 differential hybridization) (Gibbs et al . (1989) Nucleic 
Acids Res. 17:2437-2448) or at the extreme 3 ' . end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238). In addition, it may be desirable to 
5 introduce a novel restriction site in the region of the 
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mutation to create cleavage-based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). it is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 
5 (1991) Proc, Natl. Acad. Sci . USA 88:189). In such 
cases, ligation will occur only if there is a perfect 
match a- the 3' end of the 5' sequence making it possible 
to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

10 amplification. 

The methods described herein may be performed, for 
example ; by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

15 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a gene encoding a polypeptide of the 
invention. 

Furthermore, any cell type or tissue, preferably 
20 peripheral blood leukocytes, in which the polypeptide of 
the invention is expressed may be utilized in the 
prognostic assays described herein. 

3 . Pharmacoqenomics 

25 Agents, or modulators which have a stimulatory or 
inhibitory effect on activity or expression of a 
polypeptide of the invention as identified by a screening 
assay described herein can be administered to individuals 
to treat (prophylact ically or therapeutically) disorders 

30 associated with aberrant activity of the polypeptide. In 
conjunccion with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual's genotype and that individual's response to a 
foreign compound or drug) of the individual may be 

35 considered. Differences in metabolism of therapeutics 
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can lead to severe toxicity or therapeutic failure by 
altering the relation oetween dose and blood 
concentration of the pharmacologically active drug. Thus, 
the pharmacogenomics of the individual permits the 
5 selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
appropriate dosages and therapeutic regimens. 

10 Accordingly, the activity of a polypeptide of the 
invention, expression of a nucleic acid of the 
invention, or mutation content of a gene of the invention 
in an individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 

15 treatment of the individual. 

Pharmacogenomics deals with clinically significant 
hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 43(2):254- 

20 266. In general, two types of pharmacogenetic conditions 
can be differentiated. Genetic conditions transmitted as 
a single factor altering the way drugs act on the body 
are referred to as "altered drug action." Genetic 
conditions transmitted as single factors altering the way 

25 the body acts on drugs are referred to as "altered drug 
metabolism" . These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, 
glucose-6 -phosphate dehydrogenase deficiency (G6PD) is a 
common inherited enzymopathy in which the main clinical 

30 complication is haemolysis after ingestion of oxidant 
drugs (anti-malarials , sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug 
metabolizing enzymes is a major determinant of both the 
35 intensity and duration of drug action. The discovery of 
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genetic polymorphisms of drug metabolizing enzymes (e.g., 
N-acetyl transferase 2 (NAT 2) and cytochrome P450 enzymes 
CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or 
5 show exaggerated drug response and serious toxicity after 
taking the standard and safe dose of a drug. These 
polymorphisms are expressed in two phenotypes in the 
population, the extensive metabolizer (EM) and poor 
metabolizer (PM) . The prevalence of PM is different 

10 among different populacions. For example, the gene 
coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 

15 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, a PM will show no therapeutic 
response, as demonstrated for the analgesic effect of 
codeine mediated by its CYP2D6- formed metabolite 

20 morphine. The other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. 
Recently, the molecular basis of ultra-rapid metabolism 
has been identified to be due to CYP2D6 gene 
amplification. 

25 Thus, the activity of a polypeptide of the invention, 
expression of a nucleic acid encoding the polypeptide, .or 
mutation content of a gene encoding the polypeptide in an 
individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 

30 treatment of the individual. In addition, 

pharmacogenetic studies can be used to apply genotyping 
of polymorphic alleles encoding drug-metabolizing enzymes 
to the identification of an individual's drug 
responsiveness phenctype . This knowledge, when applied 

35 to dosing or drug selection, can avoid adverse reactions 
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or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a 
modulator of activity or expression of the polypeptide, 
such as a modulator identified by one of the exemplary 
5 screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 
compounds) on the expression or activity of a polypeptide 
of the invention (e.g., the ability to modulate aberrant 
10 cell proliferation and/or differentiation) can be applied 
not only in basic drug screening, but also in clinical 
trials. For example, the effectiveness of an agent, as 
determined by a screening assay as described herein, to 
increase gene expression, protein levels or protein 
15 activity, can be monitored in clinical trials of subjects 
exhibiting decreased gene expression, protein levels, or 
protein activity. Alternatively, the effectiveness of an 
agent, as determined by a screening assay, to decrease 
gene expression, protein levels or protein activity, can 
20 be monitored in clinical trials of subjects exhibiting 
increased gene expression, protein levels, or protein 
activity. In such clinical trials, expression or 
activity of a polypeptide of the invention and 
preferably, that of other polypeptide that have been 
25 implicated in for example, a cellular proliferation 
disorder, can be used as a marker of the immune 
responsiveness of a particular cell. 

For example, and not by way of limitation, genes, 
including those of the invention, that are modulated in 
30 cells by treatment with an agent (e.g., compound, drug or 
small molecule) which modulates activity or expression of 
a polypeptide of the invention (e.g., as identified in a 
screening assay described herein) can be identified. 
Thus, to study the effect of agents on cellular 
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proliferation disorders, for example, in a clinical 
trial, cells can be isolated and RNA prepared and 
analyzed for the levels of expression of a gene of the 
invention and other genes implicated in the disorder. 
5 The levels of gene expression (i.e., a gene expression 
pattern) can be quantified by Northern blot analysis or 
RT-PCR, as described herein, or alternatively by 
measuring the amount of protein produced, by one of the 
methods as described herein, or by measuring the levels 

10 of activity of a gene of the invention or other genes. 
In this way, the gene expression pattern can serve as a 
marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may 
be determined before, and at various points during, 

15 treatment of the individual with the agent. 

In a preferred embodiment, the present invention 
provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 

20 acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
steps of (i) obtaining a pre-administrat ion sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of the polypeptide or nucleic acid of 

2 5 the invention in the preadministrat ion sample; (iii) 

obtaining one or more post -administration samples from 
the subject; (iv) detecting the level the of the 
polypeptide or nucleic acid of the invention in the post- 
administration samples; (v) comparing the level of the 
30 polypeptide or nucleic acid of the invention in the pre- 
administrat ion sample with the level of the polypeptide 
or nucleic acid of the invention in the post- 
administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. 

3 5 For example, increased administration of the agent may be 
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desirable to increase the expression or activity of the 
polypeptide to higher levels than detected, i.e., to 
increase the effectiveness of the agent. Alternatively, 
decreased administration of the agent may be desirable to 
5 decrease expression or activity of the polypeptide to 
lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

C . Methods of Treatment 

The present invention provides for both prophylactic 
0 and therapeutic methods of treating a subject at risk of 
(or susceptible to) a disorder or having a disorder 
associated with aberrant expression or activity of a 
polypeptide of the invention. 

1 . Prophylactic Methods 

5 In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 
associated with an aberrant expression or activity of a 
polypeptide of the invention, by administering to the 
subject an agent which modulates expression or at least 

0 one activity of the polypeptide. Subjects at risk for a 
disease which is caused or contributed to by aberrant 
expression or activity of a polypeptide of the invention 
can be identified by, for example, any or a combination 
of diagnostic or prognostic assays as described herein. 

5 Administration of a prophylactic agent can occur prior to 
the manifestation of symptoms characteristic of the 
aberrancy, such that a disease or disorder is prevented 
or, alternatively, delayed in its progression. Depending 
on the type of aberrancy, for example, an agonist or 

0 antagonist agent can be used for treating the subject. 
The appropriate agent can be determined based on 
screening assays described herein. 
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2 . Therapeutic Methods 

Another aspecc of the invention pertains to methods of 
modulating expression or activity of a polypeptide of the 
invention for therapeutic purposes. The modulatory 
5 method of the invention involves contacting a cell with 
an agent that modulates one or more of the activities of 
the polypeptide. An agent that modulates activity can be 
an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of the 

10 polypeptide, a peptide, a pept idomimet ic , or other small 
molecule. In one embodiment, the agent stimulates one or 
more of the biological activities of the polypeptide. 
Examples of such stimulatory agents include the active 
polypeptide of the invention and a nucleic acid molecule 

15 encoding the polypeptide of the invention that has been 
introduced into the cell. In another embodiment, the 
agent inhibits one or more of the biological activities 
of the polypeptide of the invention. Examples of such 
inhibitory agents include antisense nucleic acid 

20 molecules and antibodies. These modulatory methods can 
be performed in vitro (e.g., by culturing the cell with 
the agent) or, alternatively, in vivo (e.g, by 
administering the agent to a subject) . As such, the 
present invention provides methods of treating an 

25 individual afflicted with a disease or disorder 

characterized by aberrant expression or activity a 
polypeptide of the invention. In one embodiment, the 
method involves administering an agent (e.g., an agent 
identified by a screening assay described herein) , or 

30 combination of agents that modulates (e.g., upregulates 
or downregulates) expression or activity. In another 
embodiment, the method involves administering a 
polypeptide of the invention or a nucleic acid molecule 
of the invention as therapy to compensate for reduced or 

35 aberrant expression cr activity of the polypeptide. 



WO 00/18800 



PCT/US99/22818 



- 99 - 

Stimulation of activity is desirable in situations in 
which activity or expression is abnormally low 

o 

downregulated and/or in which increased activity is 
likely to have a beneficial effect. Conversely, 
inhibition of activity is desirable in situations in 
which activity or expression is abnormally high or 
upregulated and/or in which decreased activity is likely 
to have a beneficial effect. 

This invention is further illustrated by the following 
examples which should not be construed as limiting. The 
contents of all references, patents and published patent 
applications cited throughout this application are hereby 
incorporated by reference. 

Equivalents 

Those skilled in the art will recognize, or be able to 
ascertain using no more than routine experimentation, 
many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are 
intended to be encompassed by the following claims. 



What is claimed is: 
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1. An isolaced nucleic acid molecule selected from 
the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide 
sequence which is at least 55% identical to the 
5 nucleotide sequence of SEQ ID NO:l, 3, 4, or 6, the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number , the cDNA insert of the plasmid deposited 

with ATCC as Accession Number , or a complement 

thereof ; 

10 b) a nucleic acid molecule comprising a fragment of 
at least 300 nucleotides of the nucleotide sequence of 
SEQ ID NO:!, 3, 4, or 6 , the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , the 

cDNA insert of the plasmid deposited with ATCC as 

15 Accession Number , or a complement thereof; 

c) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with ATCC as Accession Number 

20 , or the amino acid sequence of the cDNA insert of 

the plasmid deposited with ATCC as Accession Number 



d) a nucleic acid molecule which encodes a fragment 
of a polypeptide comprising the amino acid sequence of 

2 5 SEQ ID NO: 2 or 5, the polypeptide encoded by the cDNA 

insert of the plasmid deposited with ATCC as Accession 

Number , or the polypeptide encoded by the 

cDNA insert of the plasmid deposited with ATCC as 
Accession Number , wherein the fragment comprises 

3 0 at least 15 contiguous amino acids of SEQ ID NO: 2 or 5, 

the polypeptide encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

the polypeptide encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number ; and 
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e) a nucleic acid molecule which encodes a naturally 
occurring allelic variant of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2 or 5, the amino acid 
sequence encoded by the cDNA insert of the plasmid 

5 deposited with ATCC as Accession Number , or the 

amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number , 

wherein the nucleic acid molecule hybridizes to a nucleic 
acid molecule comprising SEQ ID NO: 3 or 6 or a complement 
10 thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, 
which is selected from the group consisting of: 

a) a nucleic acid comprising the nucleotide sequence 
of SEQ ID NO:l, 3, 4, or 5, the cDNA insert of the 

15 plasmid deposited with ATCC as Accession Number , 

the cDNA insert of the plasmid deposited with ATCC as 
Accession Number , or a complement thereof; and 

b) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 

2 0 NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

Number , or the amino acid sequence encoded 

by the cDNA insert of the plasmid deposited with ATCC as 
Accession Number . 



25 3. The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4. The nucleic acid molecule of claim 1 further 
comprising nucleic acid sequences encoding a heterologous 
polypeptide . 

30 5. A host cell which contains the nucleic acid 
molecule of claim 1 . 
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6. The host cell of claim 5 which is a mammalian host 
cell. 

7. A non-human mammalian host cell containing the 
nucleic acid molecule of claim 1. 

5 8. An isolated polypeptide selected from the group 
consisting of: 

a) a fragment of a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 2 or 5, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 

10 NO: 2 or 5; 

b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

15 Number , or the amino acid sequence encoded by 

the cDNA insert of the plasmid deposited with ATCC as 

Accession Number , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID NO: 1 or 4 or a 

2 0 complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid 
molecule comprising a nucleotide sequence which is at 
least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l or 4 or a complement 

2 5 thereof. 

9. The isolated polypeptide of claim 8 comprising the 
amino acid sequence of SEQ ID NO: 2 or 5, the amino acid 
sequence encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

30 the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number . 
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10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

11 . An antibody which selectively binds to a 
polypeptide of claim 8. 

12. A method for producing a polypeptide selected from 
the group consisting of: 

a) a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 2 or 5, the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 

Accession Number , or the amino acid sequence 

encoded by the cDNA insert of the plasmid deposited with 
ATCC as Accession Number 

b) a polypeptide comprising a fragment of the amino 
acid sequence of SEQ ID NO: 2 or 5, the amino acid 

5 sequence encoded by the cDNA insert of the plasmid 

deposited with ATCC as Accession Number , or 

the amino acid sequence encoded by the cDNA insert of the 

o 

plasmid deposited with ATCC as Accession Number , 

wherein the fragment comprises ?.t least 15 contiguous 

0 amino acids of SEQ ID NO: 2 or 5, the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with 

ATCC as Accession Number or the amino acid 

sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number ; and 

5 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or 5, the amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number or the amino acid sequence encoded by 

0 the cDNA insert of the plasmid deposited with ATCC as 

Accession Number , wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
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nucleic acid molecule comprising SEQ ID NO: 1 or 4 or a 
complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under 
conditions in which the nucleic acid molecule is 
5 expressed. 

13. A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 

10 b) determining whether the compound binds to the 
polypeptide in the sample. 

14. The method of claim 13, wherein the compound which 
binds to the polypeptide is an antibody. 

15. A kit comprising a compound which selectively 
15 binds to a polypeptide of claim 8 and instructions for 

use . 

16. A method for detecting the presence of a nucleic 
acid molecule of claim 1 in a sample, comprising the 
steps of : 

20 a) contacting the sample with a nucleic acid probe or 
primer which selectively hybridizes to the nucleic acid 
molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

25 17. The method of claim 16, wherein the sample 

comprises mRNA molecules and is contacted with a nucleic 
acid probe. 
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18. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 

19. A method for identifying a compound which binds to 
5 a polypeptide of claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a 
polypeptide of claim 8 with a test compound; and 

b) determining whether the polypeptide binds to the 
test compound. 

10 20. The method of claim 19, wherein the binding of the 

test compound to the polypeptide is detected by a method 

selected from the group consisting of: 

a) detection of binding by direct detecting of test 

compound/polypeptide binding; 
15 b) detection of binding using a competition binding 

assay; 

c) detection of binding using an assay for TANGO 191 
or TANGO 195 -mediated signal transduction. 

21. A method for modulating the activity of a 
20 polypeptide of claim 8 comprising contacting a 

polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 
sufficient concentration to modulate the activity of the 
polypeptide . 

25 22. A method for identifying a ^compound which 
modulates the activity of a polypeptide of claim 8, 
comprising : 

a) contacting a polypeptide of claim 8 with a test 
compound ; and 
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b) determining the effect of the test compound on the 
activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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GTCGACCC^CGCGTCCGCAGAGAAGAGTTTGAGATGCTTCTTCTTCAGAGCACTTCC-.^C 
— AAAGAGGTATCTC'CTGGATAGGAAGAAATATAGTAGAACCCTTTGAAAATGGATATT 
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16 21 GAGAGGA G7GG7GCTCC7G7ACA7CC7GC7TGGCACCA7CGGGA^ 

• 68 , GGCGGCGAGTGCCCTCCTCTACAGGCACTGGATTGAAATAGTGCTGCTGTACCGGACCTA 

j / 6 J i ■ " 

•-41 cCAGAGCAAGGA7CAGACGC77GGGGA7AAAAAGGA7777GA7GC777CG^A7CC7A7GC 
' 3 96 Q S K 0 Q T L G D K X D r D A F V S V 

■ 3 0 1 AAAATGGAGCTCTTTTCCAAGTGAGGCCACTTCATCTCTGAGTGAAGAACACTTGGCCCT 

" 41 - - :< w s S p ? S E A T S S 1 S E - * » « - 

1861 GAGCC7A777CC7^TG7??7AGAAAACAAATA7 

< 92 i ^GATGTGGCTCCAGGAGGAGTGTATGCAGAAGACATTGTGAGCATTACTAAGAGAAGCAG 
456 DVAPGGVYAEDIVSIIKRSR 

l98l aagaggaata^axcxtgac,™^ 

476 RGIF ILb * w 

- 10 - TtACTTCCAAGAGCCAGAGTCTCTACCTCATCTCGTGAAAAAAGCTCTCAGGGTTJTGCC 

CACAGTTACTrGGAGAGK^AAAATCAGTTCCTCCCAATTCTAGGTTCTGGGCCAAAAT 

536 xVTWRGLKSvf 

G«™CCACA«CC^^ 

2:81 CTCTAGGATTTTTCAGTGGAAAGGA^CAGTAGA^AGAAACCACTOTGAGGAGCTCCCA 
576 SRIFQ WKGLSR 

: 3 4 1 gcct.aaggaatggtgaaatgagccctggagccccctccagtccagtccctgggatagaga 

596 ? K E W • 

~~*r.m± sr-T ar AGCTC-GTG7GTGTGTGTTCAGGC7GATAGGAAATTCAAA 

^agccSga^ccS^^ 

252 . - TAGAG ^^ !T^^:^Ic7AAGAT7-CCCAG7GG7CCGAGCAGAATCAGAAAA7ACAG 
I i.7G7A7ACT7GCAATCAAA7AAA7AT7A7777A7TAGAAAAAAAAAAAA^ 
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1 GCG7CCGGA7G7T77CAC7777GGGACA7CC7G77C7GAG7CAAGA77CC7CC7TC7GAA 

51 CA7GGGAC777CCAGAAGGACCACAGC7CC7CCCG7GCA7CCAC7CGGCC7GGGAGG77C 

121 TGGATTTTGGCTGTCGAGGGAGTTTGCCTGCCTCTCCAGAGAAAGATGGTCATGAGGCCC 
1 M V M R F 

1 3 1 CTGTGGAGTCTGCTTCTCTGGGAAGCCCTACTTCCCATTACAG7TACTGG7GCCCAAGTG 
6LWSLL-WEALLPITVTGAQV 

241 C7GAGCAAAG7CGGGGGC7CGG7GC7GC7GG7GGCAGCGCG7CCCCC7GGC77CCAAG7C 
26LSKVGGSVLLVAARPPGFQV 

3 0 1 CG7GAGGC7A7C7GGCGA7C7CTC7GGCC7TCAGAAGAGC7CC7GGCCACG77777CCGA 
46 REAIWRSLWPS EELLAT FFR 

3 6 1 GGC7CCC7GGAGAC7C7G7ACCA77CCCGC77CC7GGGCCGAGCCCAGC7ACACAGCAAC 
66GSLE7LYHSRFLGRAQLHSN 

421 C7CAGCC7GGAGC7CGGGCCGC7GGAG7C7GGAGACAGCGGCAAC7TC7CCG7G77GA7G 
86LS1ELGPLESGDSGNFSVLM 

481 G7GGACACAAGGGGCCAGCCC7GGACCCAGACCC7CCAGC7CAAGG7G7ACGA7GCAG7G 
106VDTRGQPWTQT LQLKVYDAV 

541 CCCAGGCCCG7GG7ACAAG7G77CA77GC7G7AGAAAGGGA7GC7CAGCCC7CCAAGACC 
126 ?R*PVVQVF IAVERDAQ P SK7 

601 7GCCAGG7777C77G7CC7G77GGGC CCCCAAC A7CAGCGAAA7AACC7A7AGC7GGCGA 
146 CQVFL SCWAPNI SEI7YSWR 

661 CGGGAGACAACCATGGACTTTGGTATGGAACCACACAGCCTCTTCACAGACGGACAGGTG 
166RE7TM0FGMEPHSLF7DGQV 

721 C7GAGC A7T7CCC7GGGACCAGGAGACAGAGA7G7GGCC7ATTCC7GCA77G7C7CCAAC 
186 1SISLGPGDRDVAYSCIVSN 

7 8 1 CC7G7CAGC7GGGAC77GGCCACAG7CACGCCC7GGGA7AGC7G7CA7CATGAGGCAGCA 
206 PVSWDLATV7 PWDSCHHEAA 

841 CCAGGGAAGGCC7CC7ACAAAGA7G7GC7GC7GG7GG7GG7GCC7G7C7CGC7GC7CC7G 
226 PGKAS YKOVLLVVVPVS L.LL 

901 A7GC7GG77AC7C7C77C7C7GCC7GGCAC7GG7GCCCC7GC7CAGGGCCCCACC7CAGA 
246 MLVTLFSAWHWCPCSGPKLR 

9 6 1 7CAAAGCAGCTCTGGA7GAGATGGGACC7GCAGC7C7CCC7CCCCAAGG7GAC7C77AGC 
266 SKQLWMRWDLQLSLPKV7LS 

0 2 1 .AACC7C A777CGACAG7GG7TTGTAGCG7GGTGC ACCAGGGCC7TG77GAACAGATCCAC 
286 M LIST VVCSVVHQGLVEQIH 

081 ACG7GC7C7 AA7AAAG7TCCCA 

3 06 7 C 5 M K 7 P 
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290 300 310 320 330 
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Score: 7.10 Seq: 71 128 Model: 6 47 

•r.TnroVsfhPcdYtlwWY. . . rNoooi iltlnsWq 

7AJ0OO \H\ 71 LPFMGSNDLSDVQ--WYOQFSNGDPI^iaxSYPKIICDKCTLKFLTPG 

yEDsGt YwQnV * 

118 VNNSGSYICRP 128 



Score: 12.19 Seq: 168 223 Model; 1 47 

* Gqs VTLTQnVs . . fhPodVt . IwWYrNoooi wLtlnsWavEDs 

TAUGO ^ l 168 GSTCSISCPSLSCQSDA0SPAVTW ^ 215 

GtYwCmV 

• 217 CTYVCDY 223 



Score- 4.61 Seq: 266 339 Model: 1 47 

' -GqsVTLTCmVs . . <hPt>dYt . IwWYrNaacx . 



TAU*>WI ~ 66 GKPLTISCKARFGFERVFNPVIKWYIKDSCLE".VEVSVPEAKSIKSTLKD 314 

cLClnsWcvEOs.GtYwCaV 

315 EIIERNIILEKVTORDUUUCFVCrv 3 39 



tic*. C 
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Input file T195Atmue9fll; Output File T195Atrmie9f 1 1. pat 
Sequence length 1603 



MWSLWSLLL 3 

63 



CCCACGCGTCCGCCCACGCGTCCGCAAGGGAGGACAACGGCC ATG TGG TCC CTC TGG AGT C7T CTT CTC 

pvvV VSVQVLSKVGD 29 
TTT GAA GCT CTC CTT CCC GTT GTG GTT GTC AGT GTC CAA GTG CTA AGC AAG GTA GGG GAC 12 9 

rLV AECPPGFQVREAIWR 49 
TCA GAG CTG CTG GTG GCC GAG TGT CCT CCG GGC TTC CAA GTG CGT GAG GCT ATC TGG CGA 189 
cEELLATFFRGSLETL 69 
TCT CTG TGG CCA TCG GAG GAG CTC CTG GCC ACA TTT TTC CGA GGT TCC TTG GAG ACT CTG 24 9 
c bflGRVQLYDNLSLELG 89 
TAC CAC TCT CGT TTC CTG GGC CGA GTC CAG CTA TAT GAC AAC CTC AGC CTG GAG CTT GGA 309 

GNFS V LMVDTRGQ109 
CCC CTG aL CCT GGA GAC AGC GGC AAT TTC TCT GTG CTG ATG GTG GAT ACA AGG GGT CAA 369 

yLKV yDAVPKPEVQ129 
ACC TGG ACC CAG ACC CTG TAT CTC AAG GTG TAC GAT GCA GTA CCC AAG CCC GAG GTT CAA 429 

AEETQ PLNTCQVFLS 149 
GTG TTC ACT GCT GCA GCA GAG GAG ACC CAA CCC CTC AAT ACC TGT CAG GTC TTC TTG TCC 489 
MTSDITYSWRREGTVD1S9 
TGC TGG GCC CCC AAC ATC AGT GAC ATA ACC TAC AGC TGG CGA CGG GAG GGG ACA GTG GAC 549 
VHSHFSNGQVLSVSLG189 
TTC AAT GGT oIa GTG CAC AGC CAT TTC TCA AAT GGA CAG GTG TTA AGT GTC TCA CTG GGA 609 
„ V&FTCIASNPVSWDM209 
CTG GGG GAC AAG GAT GTG GCC TTT ACC TGC ATT GCC TCC AAT CCT GTC AGC TGG GAT ATG 669 

ACC ACA GTC ACC CCC TGG GAG AGC TGC CAT CAC GAG GCA GCC TCC GGG AAG GCC TCC TAC 72 9 

„ „ , L V V V P I T L F L I L A G L F 249 
AAG GAC GTG CTA CTG GTA GTA GTG CCA ATT ACA CTG TTC CTG ATC CTG GCT GGT CTC TTT 789 
„ orLC SGKKKDACTDGV269 
CCG GCA TGG CAC CAT GGC CTC TGC TCA GGG AAG AAG AAG GAT GCT TGC ACT GAC GGG GTG 849 

279 

LPETENALV s?3 
CTT CCA GAG ACA GAG AAT GCC CTC GTA TAG 



AGGATGTCATGAGGGACAACATAAACTGGTGCTTGGACCATGATGAGATGCCCTGCTCACCCACGTGATGCTCCACACC 
TGGACACCTCCAGGATCCTCATAACTGCCACAAGTCGGCCTGCCTCTAGCGGACAGCCAAGAAAACCACCATCCTGGAA 
CGTCACCCTGGCCCAAACTTCCTCCTTCCTCCATCCTGTTCGCACATGCCGGATCCTCTCTGGGCAAGGTGAACTAGTA 

m. 7 ( ' c '-~-<- 



958 

1037 

1116 
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GG ATGCTTCCTTCAGAACACAGGACTTTCTCTAGGATCCACAG AG ACATTGATTATCCAAGGCATCCATTCTTCTATC A 1195 

CTGTACATAAGGTCTTGCCCAACAGCCACCAAGGGACGGCCTCCAGGCCAGGACCTTGGCTCAAAGAGAGATGAGATGT 12 74 

TTGAACTAACATGGAAATTGAGCTAACCATTGCCAACTCCAACTCCAGCCCCTGGGGTCTGAGTTCCTTGTGTTCAAGA 13 5 3 

TGTTATTATAAGAAAAGGCAAAGAACAGGAAATGATGGAGGGTGGGGCATTCTCTTTCTGGTCTGAAGGACTTTAAGAT 14 3 2 

TATCTGAGTTCAAGGGCCATCAAAAGTAAATTGAGATTACAGATGATGAGGGGTTGGTAGCTAATGTGCCATGTTGGGA 1511 

TCAAAGCCATTTTTCTGGTAGCACTATATTAATAGACACCTTT 15 9 0 

AAAAAAAAAAAAA 1603 




Z C? LI 
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input file T195Athpb93fl; Output File T19SAthpb93 f 1 . pat 

Sequence length 1118 



GATGTTTTCACTTTTGGGACATCCTGTTCTGAGTCAAGATTCCTCCTTCTGAACATGGGACTTTCCAGAAGGACCACAG 




CTCCTCCCGTGCATCCACTCGGCCTGGGAGGTTCTGGATTTTGGCTGTCGAGGGAG 


TTTGCCTGCCTCTCCAGAGAAAG 


158 


M 
ATG 


V 
GTC 


M 
ATG 


R 
AGG 


P 
CCC 


L 
CTG 


W 
TGG 


S 
AGT 


L 
CTG 


L 
CTT 


L 
CTC 


W 
TGG 


E 

LjAft 


A 


L 
CTA 


L 
CTT 


F 
CCC 


T 

ATT 


T 
ACA 


V 
GTT 


20 
218 


T 
ACT 


G 
GGT 


A 

GCC 


Q 
CAA 


V 
GTG 


L 

CTG 


S 
AGC 


K 
AAA 


V 
GTC 


G 
GGG 


G 
GGC 


S 
TCG 


V 
i\j 


L 

Liu 


L 
CTG 


V 
GTG 


A 
GCA 


A 
GCG 


R 
CGT 


P 
CCC 


40 
278 


P 
CCT 


G 
GGC 


F 
TTC 


Q 
CAA 


V 
GTC 


R 
CGT 


E 
GAG 


A 

GCT 


I 

ATC 


W 
TGG 


R 
CGA 


S 
TCT 


L 


w 

Ten 


P 


S 
TCA 


E 
GAA 


E 
GAG 


L 
CTC 


L 
CTG 


60 
338 


A 
GCC 


T 
ACG 


F 
TTT 


F 
TTC 


R 
CGA 


G 
GGC 


S 
TCC 


L 
CTG 


E 
GAG 


T 
ACT 


L 
CTG 


Y 
TAC 


H 

UA1 


S 


R 
CGC 


F 
TTC 


L 
CTG 


G 
GGC 


R 
CGA 


A 
GCC 


80 
398 


Q 
CAG 


L 
CTA 


H 
CAC 


S 
AGC 


N 
AAC 


L 
CTC 


S 
AGC 


L 
CTG 


E 
GAG 


L 
CTC 


G 
GGG 


P 
CCG 


L 


E 

nun 


S 
TCT 


G 
GGA 


D 
GAC 


S 
AGC 


G 
GGC 


N 
AAC 


100 
458 


F 
TTC 


S 
TCC 


V 
GTG 


L 
TTG 


M 
ATG 


V 
GTG 


D 
GAC 


T 
ACA 


R 
AGG 


G 
GGC 


Q 
CAG 


P 


w 

i uy 


T 

ACC 


Q 
CAG 


T 
ACC 


L 
CTC 


Q 
CAG 


L 
CTC 


K 
AAG 


120 
518 


V 
GTG 


Y 
TAC 


D 
GAT 


A 
GCA 


V 
GTG 


P 

CCC 


R 
AGG 


P 
CCC 


V 
GTG 


V 
GTA 


Q 

CAA 


V 


F 
TTC 


I 

ATT 


A 
GCT 


V 
GTA 


E 
GAA 


R 
AGG 


D 
GAT 


A 
GCT 


140 
578 


0 

CAG 


P 

CCC 


S 
TCC 


K 
AAG 


T 
ACC 


c 

TGC 


Q 
CAG 


V 
GTT 


F 
TTC 


L 
TTG 


S 


c 

lul 


W 
TGG 


A 
GCC 


P 
CCC 


N 
AAC 


I 

ATC 


S 
AGC 


E 
GAA 


I 

ATA 


160 
638 


*T 
ACC 


Y 

TAT 


S 
AGC 


W 
TGG 


R 

CGA 


R 

CGG 


E 
GAG 


T 
ACA 


T 
ACC 


M 

ATG 


D 


F 
TTT 


G 
GGT 


M 
ATG 


£ 
GAA 


P 
CCA 


H 
CAC 


S 
AGC 


L 
CTC 


F 
TTC 


180 
698 


T 
ACA 


D 
GAC 


G 
GGA 


Q 
CAG 


V 
GTG 


L 
CTG 


S 
AGC 


I 

ATT 


S 
TCC 


L 
CTG 


G 
GGA 


P 
CCA 


G 
GGA 


D 
GAC 


R 
AGA 


D 
GAT 


V 
GTG 


A 

GCC 


Y 
TAT 


S 
TCC 


200 
758 


C 
TGC 


I 

ATT 


V 
GTC 


S 

TCC 


N 
AAC 


P 
CCT 


V 
GTC 


S 
AGC 


w 

TGG 


D 
GAC 


T 

Li 

TTG 


a 

GCC 


T 
ACA 


V 
GTC 


T 
ACG 


P 
CCC 


W 
TGG 


D 
GAT 


S 
AGC 


C 
TGT 


220 
818 


H 
CAT 


H 
CAT 


E 
GAG 


A 
GCA 


A 
GCA 


P 
CCA 


G 
GGG 


K 
AAG 


A 
GCC 


S 
TCC 


Y 
TAC 


K 
AAA 


D 
GAT 


V 
GTG 


L 
CTG 


L 
CTG 


V 
GTG 


V 
GTG 


V 
GTG 


P 
CCT 


240 
878 


V 
GTC 


S 
TCG 


L 
CTG 


L 
CTC 


L 
CTG 


M 
ATG 


L 
CTG 


V 
GTT 


T 
ACT 


L 
CTC 


F 
TTC 


S 
TCT 


A 
GCC 


W 
TGG 


H 
CAC 


W 
TGG 


C 
TGC 


P 
CCC 


C 
TGC 


S 
TCA 


260 
938 


G 
GGG 


P 
CCC 


H 
CAC 


L 
CTC 


R 
AGA 


S 
TCA 


K 
AAG 


Q 
CAG 


L 
CTC 


W 
TGG 


M 
ATG 


R 
AGA 


W 
TGG 


D 
GAC 


L 
CTG 


Q 
CAG 


L 
CTC 


s 

TCC 


L 
CTC 


H • 
CAC 


280 
998 


K 
AAG 


V 
GTG 


T 
ACT 


L 
CTT 


S 
AGC 


N 
AAC 


L 
CTC 


I 

ATT 


S 
TCG 


T 
ACA 


V 
GTG 


V 
GTT 


C 
TGT 


S 
AGC 


V 
GTG 


V 
GTG 


H 
CAC 


Q 
CAG 


G 
GGC 


L 
CTT 


300 
1058 


V 


E 


0 


I 


H 


T 


A 


L 


I 


K 


F 


P 


S 


L 


M 


K 


K 


K 


K 


K 


320 



GTT GAA CAG ATC CAC ACT GCT CTA ATA AAG TTC CCA TCC TTA ATG AAA AAA AAA AAA AAA 1118 
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Input file T195AthLal70f 10; Output File T19SAthLal70f 10 . pat 
Sequence length 2894 

MVMRPLWSLLLWE 13 
CCCACGCGTCCGCTCCAGAGAAAG ATG GTC ATG AGG CCC CTG TGG AGT CTG CTT CTC TGG GAA 63 

ALLP ITVTGAQVLSKVGGSV 33 
GCC CTA CTT CCC ATT ACA GTT ACT GGT GCC CAA GTG CTG AGC AAA GTC GGG GGC TCG GTG 12 3 

LL VAARPPGFQVREAIWRSL 53 
CTG CTG GTG GCA GCG CGT CCC CCT GGC TTC CAA GTC CGT GAG GCT ATC TGG CGA TCT CTC 18 3 

wpS £ELLATFFRGSLETLYH 73 
TGG CCT TCA GAA GAG CTC CTG GCC ACG TTT TTC CGA GGC TCC CTG GAG ACT CTG TAC CAT 24 3 

SRFLG RAQLHSNLSLELGPL 93 
TCC CCC TTC CTG GGC CGA GCC CAG CTA CAC AGC AAC CTC AGC CTG GAG CTC GGG CCG CTG 3 03 

ESGDS SNFSVLMVDTRGQPW113 
GAG TCT GGA GAC AGC AGC AAC TTC TCC GTG TTG ATG GTG GAC ACA AGG GGC CAG CCC TGG 3 63 

T qTLQLKVYDA-VPR. PVVQVF133 
ACC CAG ACC CTC CAG CTC AAG GTG TAC GAT GCA GTG CCC AGG CCC GTG GTA CAA GTG TTC 423 

IAV ERDAQPSKTCQVFLSCW153 
ATT GCT GTA GAA AGG GAT GCT CAG CCC TCC AAG ACC TGC CAG GTT TTC TTG TCC TGT TGG 4 83 

APNIS EITYSWRRETTMDFG173 
GCC CCC AAC ATC AGC GAA ATA ACC TAT AGC TGG CGA CGG GAG ACA ACC ATG GAC TTT GGT 54 3 

MEP HSLFTDGQVLSISLGPG193 
ATG GAA CCA CAC AGC CTC TTC ACA GAC GGA CAG GTG CTG AGC ATT TCC CTG GGA CCA GGA 60 3 

DRD VAYSCIVSNPVSWDLAT213 
GAC AGA GAT GTG GCC TAT TCC TGC ATT GTC TCC AAC CCT GTC AGC TGG GAC TTG GCC ACA 66 3 

vT pWDSCHHEAAPGKASYKD233 
GTC ACG CCC TGG GAT AGC TGT CAT CAT GAG GCA GCA CCA GGG AAG GCC TCC TAC AAA GAT 72 3 

VLLVV VPV SLLLMLVTLFSA253 
GTG CTG CTG GTG GTG GTG CCT GTC TCG CTG CTC CTG ATG CTG GTT ACT CTC TTC TCT GCC 7 83 

WHW CPCSGKKKKDVHADRVG273 
TGG CAC TGG TGC CCC TGC TCA GGG AAA AAG AAA AAG GAT GTC CAT GCT GAC AGA GTG GGT 84 3 

PETENPLVODLP* 286 
CCA GAG ACA GAG AAC CCC CTT GTG CAG GAT CTG CCA TAA 882 

AGGACAATATGAACTGATGCCTGGACTATCAGTAACCCCACTGCACAGGCACACGATGCTCTGGGACATAACTGGTGCC 961 
TGGAAATCACCATGGTCCTCATATCTCCCATGGGAATCCTGTCCTGCCTCGAAGGAGCAGCCTGGGCAGCCATCACACC 104 0 
ACGAGGACAGGAAGCACCAGCACGTTTCACACCTCCCCCTTCCCTCTCCCATCTTCTCATATCCTGGCTCTTCTCTGGG 1119 



cj (_ i ofL) 
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CAAGATGAGCCAAGCAGAACATTCCATCCAGGACACTGGAAGTTCTCCAGGATCCAGATCCATGGGGACATTAATAGTC 1198 

CAAGGCATTCCCTCCCCCACCACTATTCATAAAGTATTAACCAACTGGCACCAAGGAATTGCCTCCAGCCTGAGTCCTA 1277 

GGCTCTAAAAGATATTACATATTTGAACTAATAGAGGAACTCTGAGTCACCCATGCCAGCATCAGCTTCAGCCCCAGAC 13 5 6 

CCTGCAGTTTGAGATCTGATGCTTCCTGAGGGCCAAGGCATTGCTGTAAGAAAAGGTCTAGAAATAGGTGAAAGTGAGA 14 3 5 

GGTGGGGGACAGGGG TTTCTCTTTCTGGCCTAAGGACTTTCAGGTAATCAGAGTTCATGGGCCCTCAAAGGTAAATTGC 15m 

AGTTGTAGACACCGAGGATGGTTGACAACCCATGGTTGAGATGGGCACCGTTTTGCAGGAAACACCATATTAATAGACA 1593 

TCCTCACCATCTCCATCCGCTCTCACGCCTCCTGCAGGATCTGGGAGTGAGGGTGGAGAGTCTTTCCTCACGCTCCAGC 1672 

ACAGTGGCCAGGAAAAGAAATACTGAATTTGCCCCAGCCAACAGGACGTTCTTGCACAACTTCAAGAAAAGCAGCTCAG 1751 

CTCAGGATGAGTCTTCCTGCCTGAAACTGAGAGAGTGAAGAACCATAAAACGCTATGCAGAAGGAACATTATGGAGAGA 18 3 0 

AAGGGTACTGAGGCACTCTAGAATCTGCCACATTCATTTTCAAATGCAAATGCAGAAGACTTACCTTAGTTCAAGGGGA 1909 

GGGGACAAAGACCCCACAGCCCAACAGCAGGACTGTAGAGGTCACTCTGACTCCATCAAACTTTTTATTGTGGCCATCT 1988 

TAGGAAAATACATTCTGCCCCTGAATGATTCTGTCTAGAAAAGCTCTGGAGTATTGATCACTACTGGAAAAACACTTAA 2067 

GGAGCTAAACTTACCTTCGGGGATTATTAGCTGATAAGGTTCACAGTTTCTCTCACCCAGGTGTAACTGGATTTTT^ 2146 

GGGGCCTCAATCCAGTCTTGATAACAGCGAGGAAAGAGGTATTGAAGAAACAGGGGTGGGTTTGAAGTACTATTTTCCC 2225 

AGGGTGGCTTCAATCTCCCCACCTAGGATGTCAGCCCTGTCCAAGGACCTTCCCTCTTCTCCCCAGTTCCTGGGCAATC 2304 

ACTTCACCTrGGACAAAGGATCAGCACAGCTGGCCTCCAGATCCACATCACCACTCTTCCACTCGATTGTTCCCAGATC 23 8 3 

CTCCCTGCCTGGCCTGCTCAGAGGTTCCCTGTTGGTAACCTGGCTTTATCAAATTCTCATCCCTTTCCCACACCCACTT 2462 

CTCTCCTATCACCTTCCCCCAAGATTACCTGAACAGGGTCCATGGCCACTCAACCTGTCAGCTTGCACCATCCCCACCT 2541 

GCCACCTACAGTCAGGCCACATGCCTGGTCACTGAATCATGCAAAACTGGCCTCAGTCCCTAAAAATGATGTGGAAAGG 2620 

AAAGCCCAGGATCTGACAATGAGCCCTGGTGGATTTGTGGGGAAAAAATACACAGCACTCCCCACCTTTCTTTCGTTCA 2699 

TCTCCAGGGCCCCACCTCAGATCAAAGCAGCTCTGGATGAGATGGGACCTGCAGCTCTCCCTCCACAAGGTGACTCTTA 2778 

GCAACCTCATTTCGACAGTGGTTTGTAGCGTGGTGCACCAGGGCCTTGTTGAACAGATCCACACTGCTCTAATAAAGTT 2857 

2894 
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